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NOVEL BACTERIAL GENES AND PROTEINS THAT ARE ESSENTIAL FOR 
CELL VIABILITY AND THEIR USES 



10 Throughout this application various publications are referenced. The disclosures of these 

publications in their entireties are hereby incorporated by reference into this application in 
order to more fully describe the state of the art to which this invention pertains. 



FIELD OF THE INVENTION 



The present invention relates generally to nucleotide sequences, and polypeptides 
encoded by the sequences, that are essential for bacterial viability, and to methods of 
using the nucleotide and polypeptide sequences. 

20 BACKGROUND OF THE INVENTION 



25 



30 



Bacterial genera, such as Streptococcus, Staphylococcus, Pseudomonas, Yersinia, 
Salmonella, and Enter obacter 4 , are the cause of numerous afflictions in humans and 
animals. Bacterial infection can lead to serious health conditions, including pneumonia, 
osteomyelitis, meningitis, sinusitis, otitis, cystitis, and even food poisoning. Typically, 
these infections can be treated with standard antimicrobial agents such as antibiotics. 
However, the emergence of pathogenic bacterid strains that are resistant to antibiotics 
has risen alarmingly in the past two decades. This situation has created an urgent need 

* 

for the development of new antimicrobial agents. 

One strategy for developing new antimicrobial agents is to identify bacterial gene 
sequences that encode gene products that are essential for bacterial cell viability and 
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develop and/or identify agents which inhibit the function of the gene product. DNA 

P 

* ■ i 

sequencing technology has advanced from sequencing one gene at a time to sequencing 

entire genomes, the sum of all genes in an' organism. With the recent arrival of bacterial 

i ■ i 

• i ■ . ■ - 

genomic information, it is now possible to compare multiple bacterial genomes in an 

attempt to identify genes that encode conserved gene products. In this manner, one 

skilled in the art may identify a set of conserved bacterial genes* including a subset of 

genes that are essential for bacterial cell viability. The essential gene is then used as a 

starting point to . develop therapeutic agents that' inhibit or inactivate the product' of the 

i '' 
essential gene. t 

The availability of DNA sequence information for multiple microbial genomes is. a recent 
development The public release of the first complete genome; Haemophilus influenzae 
(Fleischmann, R.D., et al. 1995 Science 269:496-512 ), was followed in rapid succession 
by a number of public and private genome sequencing programs. Presently, some 20 
completely sequenced bacterial genomes have been published, and over 100 other 
sequencing projects are underway (Blattner, F.R., et al., 1997 Science 277:1453-74; 
Ferretti, J.J., et al., 1997 Adv Exp Med Biol 418:961-963; Koonin, E.V., et al., 1996 
Methods Enzymol 266:295-322). Analyses of these data indicate that approximately 
46% of putative bacterial genes are of unknown function having no attributable function. 

* 

Others have- pursued various strategies to identify bacterial genes that are essential for 
viability. These strategies include: identifying genes that are expressed by the bacteria 
when present in the infected host (Hensel," M., et al., 1995 Science 269:400-3), 
identifying essential genes by isolating temperature sensitive mutants (Schmid, M.B., et 
al., 1998 Curr Opin Chem Biol 2:529-34), and identifying genes in pathways known 
from prior physiological studies to be essential (Skarzynsld, T. et al., 1996 Structure 
1996 4:1465-74) 



4 

There continues to be a need to identify bacterial genes that encode gene products that are 
30 essential for cell viability, such as cell replication, growth, and survival. These genes and 
their encoded gene products can be used as a starting point towards identifying agents 
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that inhibit functions essential for cell viability, thereby causing bacterial cell stasis or 
death (e.g., antibacterial agents). 

. I ' ■ , * ' ' ' ' *'.''' ' , 

I i . | i ' ' ' ' I 

• ■ ' 1 . ■ , '' ' i 

, i - . 1 1 

The present invention provides experimental identification of novel, conserved essential 

h 1 1 

5 genes (ceg) from bacteria and their encoded protein products. The ceg genes are 
considered essential to cell viability because disruption of an endogenous ceg gene results 

i ' 

in lethality of a bacterial cell (e.g., as determined by failure to recover; viable 
chloramphenicol-resistant colonies, as described herein). . Thus, the gene products 
encoded by these genes are potentially valuable targets for chemotherapeutic intervention 
10 of bacterial infections. 

The ceg nucleotide sequences of the invention were obtained by large-scale 
computational comparisons of multiple genome sequences to identify conserved protein 
coding regions, followed by gene disruption to identify cegs. The conservation of protein 
15 sequences in many cases is believed to reflect the higher level conservation of common 
biochemical pathways essential for bacterial function and viability. 

* 

SUMMARY OF THE INVENTION 

20 The acronyms "CEG" and "ceg 5 * stand for Conserved Essential Gene. For convenience, 
the italicized term ceg refers herein to ceg nucleotide sequences. The capitalized term CEG 
refers herein to CEG polypeptide sequences. 

* 

Embodiments of the ceg nucleotide sequences and the CEG polypeptide sequences are 
25 designated CFEs which stands for CEG For Expression. The CFEs are polypeptides 
resulting from expression of the ceg nucleotide sequence. 

* * 

The„present invention provides isolated nucleotide sequences of conserved essential 
genes from bacteria, designated ceg. The invention also provides recombinant nucleic 
30 acid molecules including Hie ceg sequences of the invention, and methods of uses thereof. 
Examples of nucleic acid molecules having ceg sequences are described in SEQ ID 

3*. 
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NOS.: 1-113. The invention further provides isolated polypeptides and recombinant 
polypeptides having the CEG sequences of the invention, and methods of uses thereof 

I ■ , . • ' ' 'I 

Examples of polypeptides having CEG sequences are described in SEQ ID NOS.: 114- 

f „ ^ * ' 

226. ' , ' ' . 

5 

j , 
• t 

The ceg sequences of the present invention are DNA or RNA. Further , ttie invention 

► i 1 

are identical or nearly identical (e.g., similar) with 
the ceg sequences of the invention. The invention additionally provides polynucleotide 

* * I.-."' 

sequences that hybridize under stringent conditions to the ceg sequences of the invention. 

10 A further embodiment provides polynucleotide sequences which are complementary to 

the ceg sequences of the invention. Yet another embodiment provides ceg nucleic acid 

molecules that are labeled with , a detectable marker. Another embodiment provides 

recombinant nucleic acid molecules, such as a vector or a fusion molecule, including the 

} i • 
ceg sequences of the invention. , 

is ' . • ; ' 

The present invention provides various ceg sequences, fragments thereof having essential 
gene activity, and related molecules such as antisense molecules, oligonucleotides, 
peptide nucleic acids (PNA), fragments, and portions thereof 

20 The present invention relates to the inclusion of the polynucleotides encoding CEG gene 
products, such as CEO polypeptides, in an expression vector which can be used to 
transform host cells or organisms. Such transgenic hosts are useful for the production of 
CEG gene products for the development of antibacterial agents such as antibiotics. 

25 The invention further provides substantially purified CEG gene products, and uses 
thereof. 

The invention also relates to pharmaceutical compositions comprising antisense 
molecules capable of disrupting expression of ceg sequences, agonists, antagonists or 
30 inhibitors of CEG gene products, and antibodies reactive against the CEG polypeptides. 

4 



>ISDOCID:<WO 0149721A2 I > 



WO 01/49721 PCT/US00/35604 

These compositions are useful for preventing the growth or survival of bacteria, for 

* 

example, in the treatment of conditions associated with bacterial infections. 



i > 



BRIEF DESCRIPTION OF THE FIGURES 

5 

§ 

Figure 1: A schematic representation of the gene disruption assay, as described an Example 

i - ■ 1 ■ ' i 

3, infra. A) A recombinant vector undergoing homologous recombination with the host 
genome. B) The result of homologous recombination. 

i 

10 Figure 2: A schematic representation of the polarity test for operons, as described in 
Examples 2 and 3, infra. A) The recombinant vector undergoing homologous 
recombination with the host genome. B) Case 1 : one possible result of homologous 
recombination; the downstream Gene B has an independent promoter. C) Case 2; another 
possible result of homologous recombination; the downstream Gene B does not have an 

15 independent promoter. 

Figure 3: Purification of 2CFE 75, as described in Example 6, infra. A) Fractionation 
profile of 2CFE 75 eluted from a Ni-NTA column. B) Gel electrophoresis of pooled 
fractions of CFE 75. C) Non-denaturing gel electrophoresis to determine oligo form of 
20 2CFE 75. 



Figure 4: Fractionation profile of 2CFE 3 eluted from a hydroxyapatite column, as described 
in Example 7, infra. 

25 Figure 5: The biosynthesis pathway of Coenzyme A which starts with phosphorylation of 
pantothenate, 

* 

Figure 6: Circular dichroism spectra of 2CFE 101 and 103, as described in Example 10, 
infra. A) Circular dichroism spectra of 2CFE 101 and 103 at 25 degrees C. B) Circular 
30 dichroism thermal melt spectra of 2CFE 101 and 103 at a range of zero to 100 degrees C. 
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Figure 7: Circular dichroism spectra of aggregate and monomer pools of 2CFE 101 and 103, 
as described in Example 10, infra. A) Circular dichroism spectra of aggregate and monomer 

pools of 2CFE 101 and' 103 at 25 degrees C. B) Circular dichroism thermal melt spectra of 

< * i 

* i * , , • 

aggregate and monomer pools of 2CFE 101 and 103 at a range of zero to 100 degrees C. * * 

5 

Figure 8: Absorbance spectra ofpantothenate-'dependent production of ADP, as described in 
Example 1 0, infra. - 

i 1 i 

1 i 1 * 

Figure 9: The results of size exclusion chromatography and gel electrophoresis showing the 
10 oligomeric forms of 2CFE 21 and 39, as described in Example 11, infra. Lanes 1-6 contain 
.2CFE 21, lane 7 is a molecular weight marker, lanes 3-10 contain 2CFE 39. 

Fieure 10: Gel electrophoresis of a helicase reaction using 2CFE 21 and 39 and radiolabeled 
synthetic Holliday Junction template, as described in Example 1 1 5 infra. Lane 1 contains 

i 

15 the synthetic Holliday Junction template; lane 2 contains the synthetic duplex; lane 3 
contains a single-stranded template; lane 4 contains the helicase reaction using 2CFE 39; 
lane 5 contains the helicase reaction using 2CFE 21; lanes 6-8 contain the helicase reaction 
using 2CFE 39 and 21 at varying concentrations (e.g., 1, 2, and 3 |liM each); and lane 9 
contains the helicase reaction using 2 jjM each 2CFE 39 and 21 in the presence of ethidium 

* 

20 bromide. 

' Figure 1 1 : A graph depicting the results of the helicase reaction which were monitored by . 
measuring the unquenching of the Holliday Junction templates with time, as described in 
Example 1.1, infra. 

25 

Figure 12: Capillary electrophoresis results of 2CFE 8 with and without ssDNA, as 
described in Example 12, infra. A) Electropherogram of 2CFE 8 alone; B) 
Electropherogram of 2CFE 8 in the presence of a 32-nucleotide single-stranded oligomer. 

.30 Figure 13: Gel mobility shift assay of 2CFE 8, and 2CFE 8 in the presence of a single- 
stranded 32-mer ? as described in Example 12, infra. . A) An ethidium bromide-stained, 

6 
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native, polyacrylamide gel containing 2CFE 8, and 2CFE 8 in thfe presence of a 32-mer. B) 
The same native, polyacrylamide gel stained with Coomassie. 

. ■ - i 

Figure 14: The N-acetyl glucosamine pathway putatively mediated .by 2CFE 3 and 2CFE 
86, as described in Example 13, infra. 

i ' ' ' 

* .i 

r , 

Figure 15: Capillary electrophoresis results of 2CFE 3 with and without putative substrates, 
as described in Example 13, infra.. A) Electropherogram of 2CFE 3 with .and without 
glucosamine- 1 -phosphate. B) Electropherogram of 2CFE 3 with and without D-glucose-1- 
phosphate. C) Electropherogram of 2CFE 3 alone, 2CFE 3 and glucose- 1 -phosphate, and 
2CFE 3 and glucose-6-phosphate. D) Electropherogram of 2CFE 3 alone or in the presence 
of glucosamine- 1 -phosphate, glucosamine-6-phosphate, D-glucose, D(+) galactose, and a- 
D-glucose-1 -phosphate. 

Figure 16: Capillary electrophoresis results of FITC-derivitized 2CFE 3 polypeptide with 
and without D-glucosamine-6-phosphate (substrate) to produce the product D-glucosamine- 
1 -phosphate, using laser-induced fluorescence, as described in Example 13, infra. 
Electropherogram of D-glucosamine-6-phosphate (putative substrate), 2CFE 3 reacted with 
D-glucosamine-6-phosphate, and the product glucosamine- 1-phhosphate. 

Figure 17: Gel electrophoresis of 2CFE 86 eluted from an NirNTA column, as described in 
Example 13, infra. 

Figure 18: HPLC analysis of a coupled reaction including 2CFE 3, 2CFE 86, and D- 
glucosamine-6-phosphate to produce the product, UDP-N-acetylglucosamine-1 -phosphate 
(UDPAG), as described in Example 13, infra. 

* 

Figure 19: A fatty acid biosynthesis pathway. 
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Figure 20: Size exclusion chromatography to determine the .molecular weight and 
oligomeric form of 2CFE 34, as described in Example 14, infra.. Selected eluted samples 

were sized by gel electrophoresis. ' * • 1 , • 

i ■ ■ 

5 Figure 21: Gel electrophoresis of 2CFE 41 eluted from a Ni-NTA column, as described in 
Example 15, infra. ■ ; ' ' 

^ 

■ ■ * i 

Figure 22: Capillary electrophoresis results of 2CFE 40, 41, and 46, as described in Example 

■ , ■ i ■ .... 

15, infra. 



4 * 

Figure 23: Depicts a schematic diagram of a ligand which binds 2CFE 34. The ligand is 2- 
phenyl-N-(3 corboxyl-4hydroxyphenyl) azabicyclo [4.3.0] riona-2, 8-diene. 

i 

! , 

Figure 24: Depicts a schematic diagram of a ligand which binds 2CFE 43. The ligand is N- 
1 5 (3, 5-dinitrobenzyl)-7-trifluoromethyl benza diaza furanolactone. 

i 

i 

Figure 25: Depicts a schematic diagram of a ligand which binds 2CFE 43. The ligand is 2- 
amino (N-para-methylphenyl sulfonamide)-3-phenylpropianic acid. 

20 Figure 26: A nucleic acid sequence of 2CFE1 deposited with the American Type Culture 
Collection as ATCC. designation on December 20, 2000. 

Figure 27: A nucleic acid sequence of 2CFE2 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

25 

Figure 28: A nucleic acid sequence of 2CFE3 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 29: A nucleic acid sequence of 2CFE4. deposited with the American Type Culture 

* 

30 Collection as ATCC designation on December 20, 2000. 
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Figure 30: A nucleic acid sequence of 2CFE5 deposited with the American Type Culture 



Collection as ATCG designation , • on December 20, 2000. 

., .* - • , , 

Figure 31: A nucleic acid sequence of 2CFE6 deposited with the American Type Culture 

Collection as ATCC designation ' , on December 20, 2000. , 

i' . i 

I'''". ' . ' 

' " * t 

Figure 32: A nucleic acid sequence of 2CF!E7 deposited with the American Type Culture 
Collection as ATCC dbsignation . on December 20, 2000. 

p * i 

' i i * 

Figure 33: A nucleic acid sequence of 2CFE8 deposited with the American Type Culture 
Collection as ATCC designation . on December 20, 2000. 

Figure 34: A nucleic acid sequence of 2CFE9 deposited with the American Type Culture 
1 5 Collection as ATCC designation on December 20, 2000. 

Figure 35: A nucleic acid sequence of 2CFE10 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

20 Figure 36: A nucleic acid sequence of 2CFE1 1 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

■ 

Figure 37: A nucleic acid sequence of 2CFE12 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

25 

Figure 38: A nucleic acid sequence of 2CFE13 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 39: A nucleic acid sequence of 2CFE14 deposited with the American Type Culture 
30 Collection as ATCC designation on December 20, 2000. 
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Figure 40; A nucleic acid sequence of 2CFEi5 deposited with the. American Type Culture 
Collection as ATCC designation on December 20, 2000. 

i 

i . i 

Figure 41 : A nucleic acid sequence of 2CFE16 deposited with the American Type Culture 

5 Collection as ATCC designation , on December 20, 2000. 

< t 

* h t * i 

« * + ! 

Figure 42: A nucleic acid sequence of 2CFE17 deposited with thes American Type Culture 

Collection as ATCC designation ■ on December 20, 2000^ 

i I 

i 

* ' I r < 

10 Figure 43: A nucleic acid sequence of 2CFE19 deposited with the American Type Culture 
Collection as ATCC designation . ' on December 20, 2000. 

r 

■ i ■ 

Figure 44: A nucleic acid sequence of 2CFE21 deposited with the American Type Culture 

' * I . 
Collection as ATCC designation - ! on December 20, 2000. , 

15 

Figure 45: A nucleic acid sequence of 2CFE24 deposited with the American Type Culture 

r 

Collection as ATCC designation on December 20, 2000. 

Figure 46: A nucleic acid sequence of 2CFE25 deposited with the American Type Culture 
20 Collection as ATCC designation on December 20, 2000. 

4 * 

Figure 47: A nucleic acid sequence of 2CFE26 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

25 * Figure 48: A nucleic acid sequence of 2CFE27 deposited with the American Type Culture 
Collection as ATCC designation • on December 20, 2000. 

Figure 49: A nucleic acid sequence of 2CFE28 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

30 
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Figure 50: A nucleic acid sequence of 2CFE29 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 51: A nucleic acid sequence of 2CFE30 deposited with the American Type Culture 

i 1 

Collection as ATCC designation • ' ' on December 20, 2000. 

■ 

i' . . ■ * t ■ 
Figure 52: A nucleic acid sequence of 2CFE3 1 deposited with the American Type Culture 

Collection as ATCC designation '' on December 20, 2000. 

* ■ ■ h ■ 

♦ * * 

Figure 53: A nucleic acid sequence of 2CFE32 deposited with the American Type Culture 
Collection as ATCC designation ' ■ on December 20, 2000. 

■ i ■ • , • " 

Figure 54: A nucleic acid sequence of 2CFE33 deposited with the American Type Culture 

Collection as ATCC designation ' on December 20, 2000. 

Figure 55: A nucleic acid sequence of 2CFE34 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 56: A nucleic acid sequence of 2CFE35 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 57: A nucleic acid sequence of 2CFE36 deposited with the American Type Culture 
Cpllection as ATCC designation on December 20, 2000. 

Figure 58: A nucleic acid sequence of 2CFE37 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 59: A nucleic acid sequence of 2CFE38 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 
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Figure 60: A nucleic acid sequence of 2CFE39 deposited with the American Type Culture 
Collection as ATCC designation . on December 20, 2000. 

Figure 61 : A nucleic acid sequence of 2CFE40, deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 



Figure 62: A nucleic acid sequence of 2CFE41 deposited with the American Type Culture 
Collection as ATCC designation ' on December 20, 2000. 

1 0 Figure 63 : A nucleic acid sequence of 2CFE42 deposited with the American Type Culture 
Collection as ATCC designation ■ on December 20, 2000. 

i 

■ i ■ ' 

Figure 64: A nucleic acid sequence of 2CFE43 deposited with the American Type Culture 
Collection as ATCC designation _. on December 20, 2000: 

15 

i 

Figure 65: A nucleic acid sequence of 2CFE44 deposited with the American Type Culture" 
Collection as ATCC designation on December 20, 2000. 

Figure 66: A nucleic acid sequence of 2CFE45 deposited with the American Type Culture 
20 Collection as ATCC designation on December 20, 2000. 

Figure 67: A nucleic acid sequence of 2CFE46 deposited with the American Type Culture 
Collection as ATCC designation : on December 20, 2000. 

25 Figure 68: A nucleic acid sequence of 2CFE47 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000 . 

Figure 69: A nucleic acid sequence of 2CFE48 deposited with the American Type Culture 
Collection as ATCC designation . on December 20, 2000. 

30 
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Figure 70; A nucleic acid sequence of 2CFE49 deposited with the American Type Culture 

* i 1 

i 

Collection as ATCC designation on December 20, 2000. 

i ■ * ■ i , - 1 1 * - ■ i 

i ■ , 'i 

Figure 71: A nucleic acid sequence of 2CFE50 deposited with the American Type' Culture 
Collection as ATCC designation on December 20, 2000. 

I L 

I 

I ' 

Figure 72: A nucleic acid sequence of ;2CFE51 deposited with the American Type Culture 
Collection as ATCC designation ■ on December 20, 2000. ' 



10 Figure 73: A nucleic acid sequence of 2CFE52 deposited with the American T^p'e Culture 
Collection as ATCC designation on December 20, 2000. 

/ 

Figure 74: A nucleic acid sequence of 2CFE53 deposited with the American Type Culture 
Collection as ATCC designation 1 > . on December 20, 2000, 

15 

Figure 75: A nucleic acid sequence of 2CFE54 deposited with the American Type Culture 
Collection as ATCC designation ._ on December 20, 2000. 

Figure 76: A nucleic acid sequence of 2CFE55 deposited with the American Type Culture 
20 Collection as ATCC designation on December 20, 2000. 

Figure 77: A nucleic acid sequence of 2CFE56 deposited with the American Type Culture 
Collection as ATCC designation ■_ on December 20, 2000. 

25 Figure 78: A nucleic acid sequence of 2CFE57 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 79: A nucleic acid sequence of 2CFE58 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

30 
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Figure 80: A nucleic acid sequence of 2CFE59 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 



Figure 81: A nucleic acid sequence of 2CFE60 deposited with* the American Type Culture 

5 Collection as ATCC designation > on December 20, 2000. 

■ 1 1 

Figure 82: A nucleic acid sequence of 2CFE61 deposited with the American Type 1 Culture 

' + i 

Collection as ATCC designation ; on December 20, 2000. 

, 1 1 j ' 

i i i ' . 

i 

10 Figure 83: A nucleic acid sequence of2CFE62 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

* 

i i 

* \ 

Figure 84: A nucleic acid sequence of : 2CFE64 deposited with the American Type Culture 
Collection as ATCC designation ! on December 20, 2000. 

15 

Figure 85: A nucleic acid sequence of 2CFE65 deposited with the American Type Culture 
Collection as ATCC designation • on December 20,,2000. 

* 

Figure 86: A nucleic acid sequence of 2CFE66 deposited with the American Type Culture 
20 Collection as ATCC designation , on December 20, 2000. 

Figure 87: A. nucleic acid sequence of 2CFE67 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

25 Figure 88: A nucleic acid sequence of 2CFE68 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 89: A nucleic acid sequence of 2CFE69 deposited with the American Type Culture 



Collection as ATCC designation on December 20, 2000. 



30 
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Figure 90: A nucleic acid sequence of 2CFE70 deposited with the American Type Culture 

Collection as ATCC designation on December 20, 2000. 

i i , . .' . . • ' , . -i .. . ( . . 

Figure 91: A nucleic acid sequence of 2CFE71 deposited with the American Type Culture 
5 Collection as ATCC designation on December 20, 2000. 

■ i 

• ' 1 7 „.:.■.. , 

Figure 92: A nucleic acid sequence of 2CFE72 deposited with the American Type Culture 
Collection as ATCC designation ■■ ' 1 . on December 20, 2000. 

t , , , 

■ , .■ ' .7 

t 1 

■ 

10 Figure 93: A nucleic acid sequence of 2CFE75 deposited with the American Type Culture 
Collection as ATCC designation ■ on December 20, 2000. 

Figure 94: A nucleic acid sequence of 2CFE76 deposited with the American Type Culture 
Collection as ATCC designation ! on December 20, 2000. 

15 

Figure 95* A nucleic acid sequence of 2CFE78 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 96: A nucleic acid sequence of 2CFE79 deposited with the American Type Culture 
- 20 Collection as ATCC designation on December 20, 2000. 

Figure 97: A nucleic acid sequence of 2CFE80 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

25 Figure 98: A nucleic acid sequence of 2CFE81 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 99: A nucleic acid sequence of 2CFE82 deposited with the American Type .Culture 
Collection as ATCC designation on December 20, 2000. 
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Figure 100: A nucleic acid sequence of 2CFJS83 deposited with the Ame^can Type Culture 

Collection as ATCC designation . on December 20, 2000. 

'I • • • < v 1 . •> • ■ . ■■ • 

Figure 101 : A nucleic acid sequence of 2CFE84 deposited with the Amierican Type Culture 

» 

5 Collection as ATCC designation on December, 20, 2000^ 

'■ ' . . . ' 

Figure 102: A nucleic acid sequence of 2CFE85 deposited with the American Type Culture 

1 1 * i • , ■ ■ 

Collection as ATCC designation ' on December 20, 2000. 

' • ' , •' . • ' 

* ' * p 

i ► * 

i 

• 1 
10 Figure 103: A nucleic acid sequence of 2CFE86 deposited with the American Type Culture 

■ '. ' 

Collection as ATCC designation , on December 20, 2000. 

• i- 

Figure 1 04: A nucleic acid sequence of 2CFE87 deposited with the American Type Culture 

* * * • * * 

Collection as ATCC designation on December 20, 2000. 

15 

Figure 105: A nucleic acid sequence of 2CFE88 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 106: A nucleic acid sequence of 2CFE89 deposited with the American Type Culture 
20 Collection as ATCC designation on December 20, 2000. 

Figure 107: A nucleic acid sequence of 2CFE90 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

25 Figure 108: A nucleic acid sequence of 2CFE91 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

Figure 109: A nucleic acid sequence of 2CFE92 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

30 
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Figure 110: A nucleic acid sequence of 2CFE94 deposited with the American Type Culture 

■ • 

Collection as ATCC designation on December 20, 2000. 

1 ' t I 

> i I 

, * I 

I 

f , I 

Figure 1 1 1 : A nucleic acid sequence of 2CFE95 deposited with the American Type Culture 
Collection as ATCC designation , on December 20, 2000. 

y 

* 1 ■ 

Figure 1 12: A nucleic acid sequence of 2CFE96 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 20001 



1 0 Figure 1 1 3 : A nucleic acid sequence of 2CFE97 deposited with the American Type Culture 
Collection as ATCC designation on December 20, 2000. 

i - 

Figure 1 14: A nucleic acid sequence of 2CFE99 deposited with the American Type Culture 

m * * 

Collection as ATCC designation on December 20, 2000. 

15 

. Figure 115: A nucleic acid sequence of 2CFE101 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 116: A nucleic acid sequence of 2CFE102 deposited with the American Type 
20 Culture Collection as ATCC designation on December 20, 2000. 

Figure 117: A nucleic acid sequence of 2CFE103 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

25 Figure 118: A nucleic acid sequence of 2CFE104 deposited with the American Type 
Culture . Collection as ATCC designation on December 20, 2000. 

Figure 119: A nucleic acid sequence of 2CFE105 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

30 
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Figure 120: A nucleic acid, sequence of 2CFE106 deposited with the American Type 
Culture Collection as ATCC designation . on December 20, 2000. 

• ' i 

Figure 121: A nucleic acid sequence of 2CFE107 deposited with the American Type 

5 Culture Collection as ATCC designation on December 20, 2000. 

. • * i 

Figure 122: A- nucleic acid sequence of 2CFE108 deposited with' the American Type 
Culture Collection as ATCC designation ■ on December 20, 2000. 

■ 

10 Figure 123: A nucleic acid sequence of 2CFE109 deposited with the American Type 
Culture Collection as ATCC designation , on December 20, 2000. 

■ 

■ i 

Figure 124: A nucleic acid sequence of 2CFE111 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

15 

Figure 125: A nucleic acid sequence of 2CFE112 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 126: A nucleic acid sequence of 2CFE113 deposited with the American Type 
20 Culture Collection as ATCC designation on December 20, 2000. 

* 

Figure 127: A nucleic acid sequence of 2CFE114 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

25 Figure 128: A nucleic acid sequence of 2CFE115 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

Figure 129: A nucleic acid sequence of 2CFE116 deposited with the American Type 
Culture Collection as ATCC designation ; on December 20, 2000. 
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I '■ 

Figure 130: A nucleic acid sequence of 2CFE117 deposited with the American Type 
Culture Collection as ATCC designation on December 20, 2000. 

* i 

I ■ . , ■ . • I 

■ .-,''(( 
* * t 

Figure 131 : Schematic structures of alkyloids which are ligaridsj for example, of 2CFE42. 
5 ■ 

DETAILED DESCRIPTION OF THE INVENTION , 

i ' 

t I 

Definitions 

.' . ' 

10 All scientific and technical terms used in this application have meanings commonly used in 
the art unless otherwise specified. As used in this application; the following words or 
phrases have the meanings specified". ! 

! ' 

As used herein, a ceg nucleic acid molecule is said to be "isolated" when the nucleic acid 
15 molecule is substantially separated from contaminant nucleic acid molecules that encode 
polypeptides r other than CEGs. Additionally, isolated nucleic acid molecule refers to any 
RNA or DNA sequence obtained from a natural source, or constructed by recombinant 
methods, or synthesized. A skilled artisan can readily employ nucleic acid isolation 
procedures to obtain an isolated nucleic acid molecule having ceg sequences. 

20 

The term "ceg" includes all isolated forms of ceg nucleotide and CEG amino acid sequences 

* 

disclosed herein. The ceg sequences encode gene products that have essential biological 
functions in bacterial cells, such as, for example, nucleotide biosynthesis, amino acid 
biosynthesis, DNA replication, RNA transcription, protein translation, DNA 

25 recombination, DNA repair, biosynthesis of cofactors (e.g., Coenzyme A), biosynthesis 
of prosthetic groups, cellular processes (e.g., chaperones, cell division, and polypeptide 
secretion), energy metabolism (e.g., pentose phosphate pathway, glycolysis, 
gluconeogenesis), fatty acid biosynthesis, cell wall biosynthesis, and/or biosynthesis of 
purines, pyrimidines, nucleosides, and nucleotides. Accordingly, the gene products of the 

30 ceg nucleotide sequences are required for viability of bacterial cells. The term "ceg" also 
includes variants having nucleotide sequence similarity to the disclosed ceg sequences, 

19 
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I ; 

including sequences isolated from various bacterial genera and species, allelic variants, 
mutant variants, and ceg variants that encode conservative and non-conservative amino acid 
substitutions. The present invention also provides for all ceg sequences 'generated by 
recombinant DNA technology, including complementary seqtiencies, ceg sequences that 
5 hybridize to the sequences of the invention at high stringency hybridization conditions, 
fusion genes comprising a ceg sequence, and codon usage variants. 1 

i 

■■'i ■ . i , 

■ + » i 

i * 

The term "essential genes" refers to a nucleotide sequence that encodes a gene product 
having a function which is required for cell viability. The term "essential protein" refers 
10 to a polypeptide that is encoded by an essential gene and has a function that is required 
for cell viability. Accordingly, a mutation that disrupts the function of the essential gene 
or essential proteins results in a loss of viability of cells harboring the mutation. 

i 

"Non-essential genes" or "non-essential proteins" refer to genomic information, of the 

i * 

15 protein(s) or RNAs encoded therefrom which, when disrupted by a mutation, do not 
result in a loss of viability of cells harboring said mutation under defined laboratory 
conditions. 

* 

As used herein, a nucleotide sequence is said to be "identical" to another reference 

« ..- 

20 sequence when both nucleotide sequences are exactly alike. 

As used herein, a nucleotide sequence is said to be "similar" to another reference 
sequence when a comparison of the two sequences shows that they have a low level of 
sequence differences. For example, two sequences are considered to be similar to each 
25 other when the percentage of nucleotides that are shared between the two sequences is 
between about 70 % to 99.99% over the entire length of the two sequences. 

As used herein an amino acid sequence is said to be "similar" to another reference 
sequence when a comparison of the two sequences shows that they have a low level of 
30 sequence differences. For example, two sequences are considered to be similar to each 
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1 i 

* r 

other when the percentage of amino acids that are shared between the two sequences may 
be between about 30% to 1 00% identity over the entire length of the two sequences. 

■ ■ . . : 

i 

As used herein, ah "allele" or "allelic sequence" is ah alternative form of the naturally- 

5 occurring ceg sequence. Alleles result from a mutation, that changes the nucleotide 

sequence, and generally produce altered mRNAs or polypeptides whose structure or 

i ■ i . , j 

function may or may not be altered. 

! , ' i ' , . • 

"Substantially purified" as used herein means a specific isolated nucleic acid or protein, 

i 

10 or fragment thereof, in which substantially all contaminants (i.e. substances that differ 

i 

from said specific molecule) have been separated from said nucleic acid or protein. 

i 

■ i - , • 
In a host cell, an "endogenous" sequence as used herein means a nucleic acid sequence 

■ ! * I 

that is naturally-occurring and resides within the host genome. 
15 . 

t 

In a host cell, an "exogenous" sequence as used herein means an isolated nucleic acid 
sequence that is introduced into the host cell, using any one of a . variety of introduction 
methods, such' as transfection, 6lectroporation, cationic lipid or salt treatment methods. 

"Knockout mutant" or "knockout mutation" as used herein refers to an in vitro engineered 
disruption of a region of endogenous chromosomal DNA (e.g., disruption of the genome), 

* 

typically within a protein coding region. A knockout mutation can be generated by 
inserting an exogenous DNA sequence into the homologous endogenous sequence; A 
knockout mutation occurring in a protein coding region is expected to disrupt normal 
expression of the protein coding region. This usually leads to loss of the function 
provided by the protein. 

* 

In order that the invention herein described may be more fully understood, the following 
description is set forth. 
30 
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A) MOLECULES OF THE INVENTION 

t 

f 

1.) CEG NUCLEIC ACID MOLECULES ' 

5 The present invention provides isolated and recombinant ceg nucleic acid molecules and 

fragments thereof, and related molecules, such as sequences complementary to ceg 

i * i • 1 

sequences or a portion thereof, and those that hybridize to the nucleic acid molecules of 

* 

' the invention. , ' 1 1 

i t . . 

i 1 " 

10 The ceg polynucleotide sequences, also referred to herein as nucleic acid molecules of the 
invention, are preferably in isolated form, including DNA, RNA, DNA/RNA hybrids, and 
related molecules, and fragments thereof. Specifically contemplated are genomic DNA, 

ribozymes, and ahtiserise molecules, as well as nucleic acid molecules based on an 

i 

alternative backbone or including alternative bases, whether derived from natural sources or 
1 5 synthesized. Embodiments of particular ceg polynucleotide and amino acid sequences 
include, but are not limited to, the sequences described in Tables I and II (e.g., SEQ ID 
NOS:M13, 114-226 and SEQ ID NOS: 227-339, 340-452, respectively). The ceg 
polynucleotide and amino acid sequences were designated cfe which stands for CEG For 
Expression. 

20 

Biological samples of the 2CFE nucleic acid molecules (e.g., SEQ ID NOS: 227-331) 
were deposited on December 20, 2000 with the American Type Culture Collection 
(ATCC), 10801. University Blvd., Manassas, VA 20 1 1 0-2209. 



TABLE I 



CFE Designation 


SEQ. ID NO. 


SEQ. D3 NO. 


POLARITY 




• (Nucleotide) 


(Polypeptide) 




CFE 1 


1 


114 


+ 


CFE 2 


2 


115 




CFE 3 


3 


116 




CFE 4 


4 


117 


+ 


CFE 5 


5 
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119 

* 
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i > 



CFE Designation SEQ.IDNO. SEQ.IDNO..'. POLARITY 

(Nucleotide) (Polypeptide), , , 1 ' 
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TABLE I 



CFE Designation SEQ. ID NO. SEQ*ID,N6. POLARITY, 

(Nucleotide) (Polypeptide) 
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TABLE I 



CFE Designation i SEQ. ID NO. SEQ. ID NO. POLARITY 

(Nucleotide) (Polypeptide) 
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CFE Designation 


SEQ.IDNO. , 

(Nucleotide) 


SEQ. ID NO. 
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a) Variant ceg Nucleotide Sequences 

i , , ■ • - i 

The present invention also provides nucleic acid molecules having a nucleotide sequence 

5 substantially identical or similar to the ceg sequences (SEQ ID NOS: 1-113, 227-331) 

disclosed herein. 

* ' r 

i i — 

■ The present invention provides nucleotide sequences which are similar to SEQ ID. 

j - 

NOS:l 7 H3 and/or SEQ ID NOS:227~331. . The present invention provides nucleotide 
10 . sequences which vary from SEQ ID NOS: 1-1 13 or 227-331 by a range of abput 1% to 
about 70%. 

The present invention encompasses variations in polynucleotide sequences resulting from 
mutations and/or from transfer of genetic material from one cell to another (e.g., 
1 5 horizontal gene transfer or horizontal gene exchange). 

F 

The present invention also provides for variants of the polynucleotide ceg sequences 
disclosed herein, including variants isolated from naturally-occurring sources, those 
generated by recombinant DNA technology or other in vitro synthesis methodologies 
20 (e.g., PCR). The variant polynucleotide sequences of the invention encode polypeptides 
that exhibit the biological activity of naturally-occurring CEG polypeptides, such as 
activity required for bacterial cell viability. 

In general, for example, a variant of ceg polynucleotide sequences may encode a 
25 polypeptide that differs by one or more amino acid substitutions. The variant may have 
conservative changes, wherein a substituted amino acid has similar structural or chemical 
properties, eg, replacement of leucine with isoleucine. 

* 

A polynucleotide sequence can encode conservative amino acid substitutions without 
30 altering either the conformation or the function of the polypeptide. Such changes include 
substituting any of isoleucine (I), valine (V), and leucine (L) for any other of these 

29 
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r' 



hydrophobic amino acids; aspartic acid (D) for glutamic acid (E) and vice versa; 
glutamine (Q) for asparagine (N) and vice versa; and serine (S) for threonine (T) and vice 
versa. Other substitutions can also be considered conservative, depending • on the 
environment of the particular amino acid and its role in the three-dimensional structure of 
5 the protein. For example, glycine (G) and alanine (A) can frequently be interchangeable, 
as can alanine (A) and valine (V). Methionine (M), which is relatively hydrophobic, can 

frequently be interchanged with leucine and isoleucine,. and sometimes with valine. 

1 ■ 1 . , i ■ t 1 

Lysine (K) and arginine (R) are frequently interchangeable in locations in which the 
significant feature of the amino acid residue is its charge and the differing pK's of these 
10 two amino acid residues are not significant. Still other changes can be considered 
"conservative" in particular environments. 

i > 

1 

A variant may also have nonconservative changes, eg, replacement of a glycine with a 
tryptophan. Other variations may also* include amino acid deletions or insertions, or both. 
15 Guidance in determining which and how many amino acid residues may be substituted, 
inserted or deleted without abolishing biological or immunological activity may be found 
using computer programs well known in the art, for example, DNASTAR software. 

Another type of ceg sequence variant includes naturally-occurring allelic variants of ceg 
20 which share significant similarity (e.g., between about -30- 99%) to the disclosed CEG 
polypeptide sequence. Allelic variants of the ceg sequences can encode conservative or 
non-conservative amino acid substitutions of the CEG polypeptide sequence herein 
described. 



25 An example of , allelic variants of ceg are mutant alleles of ceg polynucleotide sequences that 
encode a polypeptide having one or more changes in the polypeptide sequence, such as 
amino acid substitutions, deletions, insertions, frame shifts, or truncations. The mutant 
alleles of ceg may or may not encode a CEG polypeptide having the same biological 
functions as wild-type CEG proteins. 

30 
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Variations in the bacterial genomic sequences can also arise from transfer of genetic 
material to another bacterial cell. The transfer of gene sequences can occur intraspecies 

or interspecies. Gene transfer can occur between bacterial cells which are members of 

1 t ■ 

the same or different populations. A population includes, but is not limited to, a serotype 

i 1 

5 isolate, a clinical isolate, a naturally-occurring isolate, a strain, and a species. The 

transfer of genetic material can occur between cells witliin a population for example 

. .I .. . . ■ ■ i 

transfer between serotype A to serotype A; or between S. pneumoniae and S. 

pneumoniae,) The transfer of genetic material can occur between cells of different 

i i i , 

populations; for example, between serotype A to serotype B or S. pneumoniae and S. 
10 mutans. 1 • , 

i 

Gene transfer can give rise to mutant or polymorphic variant genes sequences. Li rare 
cases, gene transfer introduces new gene sequences that confer a new phenotype, such as 

antibiotic resistance. The transfer of genetic material includes transfer of large regions of 

i 

1 5 genomic sequences which include partial gene sequences, whole single gene sequences, 
or multiple gene sequences. This mode of transfer can give rise to replacement of native 
whole gene sequences or introduction of new sequences in the recipient cell. This mode 
of transfer gives rise to mosaic gene sequences in the recipient cell. 

20 The variation of genomic sequences resulting from gene transfer can be examined using 
molecular techniques, including: multilocus enzyme electrophoresis (Selander. R. K., et 
al., 1986 Appl Environ. Microbiol 51:837-884); and restriction endonuclease cleavage 
electrophoretic profiling (Coffey, T. J., et al., 1991 Mol Microbio. 5:2255-2260); pulse- 
field gel electrophoresis fingerprinting (Bygraves, J. A. and Maiden, M. C. J. 1992 J. 

25 Gen. Microbiol 138:523-531); and ribotyping (Stull, T. L., et al., 1988 J. Infect Dis. 
157:280-286). The degree of variation can vary greatly, and ranges from little or no 
variation as exemplified by gene sequences of E. coli (Caugant, d. A. s et al., 1981 
Genetics 98:467-490; Whittam, T. S., et al., 1983 Mol. Biol. Evol. 1:67-83; Souza, V., et 
al., 1992 Proc. Natl. Acad. Sci. USA 89:8389-8393) and Salmonella (Selander, R. K., et 

30 al., 1990 Infect. Immun. 58:2262-2275; Selander, R.K. and Smith, N. H.1990 Rev. Med 
Microbiol. 1:219-228; Smith, J. M„ et al., 1993 Proc. Natl. Acad. Sci. USA 90:4384- 
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4388), to extensive gene transfer in. Neisseria gonorrhoeae (Smith, J. M.,'et al., 1993 

Proc. Natl Acad. Set USA 90:4384-4388). 

1 ! • • • . • ■ 

i * * 

■ . ■ 1 ' 1 ' I , 

Gene transfer can be examined between various isolates of a particular microbial species 
5 which are antibiotic-sensitive or antibiotic-resistent (Coffey, T. J., et al., 1991 Molec. 
Microbiol 5:2255-2260). Molecular biology techniques can be utilized to study the 
degree of transfer between'populations, such as. for example, the degree of gene transfer 
betwefen serotypes, isolates, strains, or species '. The degree of transfer can be examined 
by comparing, for example, the penicillin binding proteins and numerous different loci 
10 which encode metabolic enzymes or capsular biosynthesis enzymes. 

• i 

! ' ' 

I I 

I 

For example, intra-species, inter-serotype, gene transfer is possible (Coffey, T. J., et al., 
1991 supra). Additionally, intraspecies gene transfer in S. pneumoniae (Coffey, T. J.,,et 
al., 1998 Mol Microbiol. 27:73-83), Vibrio cholerae (Bik, E. M., et al., 1995 EMBO J. 
15 14:209-216), and Haemophilus influenzae (Kroll, J. S. and Moxon, E. R. 1990 J. 
Bacteriol. 1 72 :' 1 3 74- 1 3 79) are possible. . 

Interspecies gene transfer is also possible (Dowson, C. G., et al., 1989 Proc. Natl. Acad. 
Set USA 86:8842-8846; Laibl, G., et al., 1991 Mol. Microbiol. 5:1993-2002; Bourgoin, 
20 F.,etal., 1999 Gene 233:151-161). 

Variant gene sequences arising from gene transfer can be continually generated in 
transformable bacteria (e.g., transformation competent), such as S. pneumoniae. For 
example, the worldwide spread of varying degrees of antibiotic resistance has. been 

25 documented and reviewed (Dowson, C. G., et al., 1994 Trends Microbiol 2:361-366; 
Spratt, B. G. in Bacterial Cell Wall, eds Ghuysen J-M. and Hakenbeck, R. 1994 pp. 517- 
534; and reviewed in Maiden, M. C. J. 1998 Clinic. Infect Dis. 27 (Supplement 1) S12- 
S20). For example, variant gene sequence arising from gene transfer can be tracked 
using a marker gene such as the gene which encodes the penicillin binding protein 

30 (Barcus, V. A., et al., 1995 FEMS Microbiol Lett 126:299-303). At the nucleotide level, 
gene sequences encoding the penicillin binding proteins in susceptible and resistant 
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strains differ by about 14% to 23% (Hakenbeck, R. 1995 Biochem. Pharmacol. 50:1121- 
1 127; Spratt, B. G. in Bacterial Cell Wall, eds Ghuysen J-M. and Hakenbeck, R. 1994 pp. 

i 517-534; Spratt, B..G., et al., 1991 Neisseria meningitidis arid Streptococcus pneumonia^ 
eds. Camisi, J., et al., pp. 73-83; Coffey, T. J., et al., 1995 Micro. Drug Resist. 1:29-34). 
5 • • 

The ceg nucleotide sequences can be isolated from various, species of {Streptococcus 
including Streptococcus pneumoniae. • Additionally, the ceg sequences can be isolated from 
other Steptococcal species, including 5- mutans, S. pyogenes, and 51 thermophila, The ceg* 
polynucleotide sequences can also lie isolated from strains of otheir bacterial genera 
1 0 including, but not limited to, Streptococcus, ■ Escherichia, Bacillus, Pse^uiidmonas, 
Yersinia, Salmonella, and Haemophilus. 

! , 

■ i 

1 

The present invention additionally provides isolated codon-usage variants that differ from 
the disclosed ceg nucleotide sequences, yet do not alter the predicted CEG polypeptide 
15 sequence or function. The codon-usage variants may be generated by recombinant DNA 

i 

technology. Codons may be selected to optimize the level of production of the ceg 
transcript or CEG polypeptide in a particular prokaryotic or eukaryotic expression host, 
in accordance with the frequency of codon utilized by the host cell. Alternative reasons 
for altering the nucleotide sequence encoding a CEG polypeptide include the production 
20 of RNA transcripts having more desirable properties, such as an extended half-life or 

• * 

increased stability. A multitude of variant ceg nucleotide sequences that encode the 
respective CEG polypeptide may be isolated, as a result of the degeneracy of the genetic 
code. Accordingly, the present invention contemplates selecting every possible triplet 
codon to generate every possible combination of nucleotide sequences that encode the 
25 disclosed CEG polypeptides. This particular embodiment provides isolated nucleotide 
sequences that vary from the sequences as described in SEQ ID NOs.: 1-113 or 227-331, 
such that each variant nucleotide sequence encodes a polypeptide having sequence 
identity with the amino acid sequences, as described in SEQ ID NOs.: 114-226 or 332- 
436, respectively. 

30 
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i * • 

b) Complementary Sequences , 

■I , . ./ ■ . .... 

I ' I. • ' | 1 

1 I 

The present invention includes polynucleotide sequences that are complementary to the 

5 sequences disclosed herein. The term "complementary 11 as used herein refers to the 

capacity of purine and/qr pyrimidine nucleotides to associate through hydrogen bonding 

to form double stranded nucleic acid molecules. The following base pairs are related by 

complementarity: guanine and cytosine; adenine and thymine; and adenine and uracil. 

* i* 
Complementary applies to all base pairs comprising at least two single-stranded nucleic 

i 1 

10 acid molecules. 

1 

i ■ 

c) Sequences Capable of Hybridizing ., 

i 

Another embodiment provides nucleic acid molecules that will hybridize to ceg 
15 • sequences under hybridization conditions. It is readily apparent to one skilled in the art 
that the stringency of the hybridization condition selected will depend upon the 
characteristics of the nucleic acid molecule to be hybridized, such as, the length, the 
degree of complementarity (e.g., exact or non-exact complementarity), the percent A/T 
content, and the objective of the hybridization experiment. 

20 

The hybridization procedure may by performed in low stringency hybridization 
conditions. Low stringency hybridization conditions will permit hybridization between 
two nucleic acid molecules that differ from exact complementarity by about 25% to 70%. 
Hybridization under standard high stringency conditions will occur between two 
25 complementary nucleic acid molecules (e.g., 100% exact complementarity) or two 
complementary nucleic acid molecules that differ from exact complementarity by about 
1% to about 70%. 

The high stringency hybridization conditions that disfavor non-homologous base pairing 
30 are well known in the art. Typically, high stringency hybridization conditions, includes 
but is not limited to, hybridizing at 50 °C to 65 °C in 5X SSPE, and washing at 50 °C to 

34 . 
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65 °C in 0.5X SSPE. Typically, low stringency conditions, includes but is not limited to, 
hybridizing at 35 °C to 37 °C in 5X SSPE and 40% to 45% formamide and washing at 42 
°C in 1-2X SSPE. The conditions arid formulas for high stringency hybridization 
methods are well known in the art and can be readily obtained in Molecular ' Cloning; A 
Laboratory Manual (2 nd edition, Sambrook, Fritch, and Maniatis 1989, Cold Spring 
Harbor Press) or in Short Protocols in Molecular Biology (Ausubel, F. M., et al., 1989, 
John Wiley & Sons), 



d) Fragments of ceg Sequences 



i.' i 



The invention further provides nucleic acid molecules having fragments of the ceg 
sequences, such as a portion of the ceg sequence (e.g., SEQ ID NOS:l-113, 227-331) 
disclosed herein. The size of the fragment will be determined by its intended use. For 
example, the length of the fragment to be used as a nucleic acid probe or PCR primer is 
15 chosen to obtain a relatively small number of false positives during probing or priming. 
Alternatively, a fragment of the ceg sequence may be used to construct a recombinant fusion 
gene having a ceg sequence fused to a non-ceg sequence. 

The nucleic acid molecules, fragments thereof, and probes and primers of the present 
20 invention are useful for a variety of molecular biology techniques including, for example, 
hybridization screens of libraries, or detection and quantification of mRNA transcripts as 
a means for analysis of gene transcription and/or expression. Preferably, the probes and 
primers are DNA. A probe or primer length of at least 15 base pairs is suggested by 
theoretical and practical considerations (Wallace, B. and Miyada,. G. 1987 
25 "Oligonucleotide Probes for the Screening of Recombinant DNA Libraries" in: Methods ■ 
in Enzymology, 152:432-442, Academic Press). Other lengths of fragments, probes, or 
primers are possible and routine to determine. 

- The probes and primers of this invention can be prepared by methods well known to 
30 those skilled in the art (Sambrook, et' al. supra). In a preferred .embodiment the probes 
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and primers are synthesized by chemical synthesis methods (ed: Gait, M. J. 1984 
Oligonucleotide Synthesis, IRL Press, Oxford, England). 

* 1 . 1 i 

. • i . 

One embodiment of the present invention provides nucleic acid primers that are 
complementary to ceg sequences, which allow the specific amplification of nucleic acid 
molecules of the invention or of any specific parts t thereof. Another* embodiment 
provides nucleic acid probes that are complementary for selectively or spepifically 
hybridizing to the ceg sequences or to any part thereof. 



10 e) Derivative Nucleic! Acid Molecules 



i.' < 



The nucleic acid molecules of the invention include peptide nucleic acids (PNAs), or 

derivative molecules such as phosphorothioate, phosphotri'ester, phosphoramidate, and 

t 

methylphosphonate, that specifically bind to single-stranded DNA or RNA in a base pair- 
15 dependent manner (Zamecnik, P. C, et al., 1978 Proc. Natl. Acad. Set 75:280284; 
Goodchild, P. C, et al., 1986 Proc. Natl. Acad Sci. 83:4143-4146). 

♦ 

PNA molecules comprise a nucleic acid oligomer to which an amino acid residue, such as 
lysine, and an amino group have been added. These small molecules, also designated 

20 anti-gene agents, stop transcript elongation by binding to their complementary (template) 
strand of nucleic acid (Nielsen, P. E., et al., 1993 Anticancer Drug Des 8:53-63). For 
example, reviews of methods for synthesis of DNA, RNA, and their analogues can be 
found in-: Oligonucleotides and Analogues, eds. F. Eckstein, 1991, IRL Press, New York; 
Oligonucleotide Synthesis, ed. M. J. Gait, 1984, IRL Press, Oxford, England. 

25 Additionally, methods for antisense RNA technology are described in U. S. patents 
5,194,428 and 5,1 10,802. A skilled artisan can readily obtain these classes of nucleic acid 
molecules using the herein described ceg polynucleotide sequences, see for example 
Innovative and Perspectives in Solid Phase Synthesis (1992) Egholm, et al. pp 325-328 or 
U. S. Patent No. 5,539,082. 

30 
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f) RNA Molecules 

The present invention provides RNA molecules that encode ■ the predicted ceg gene 

i ■ ! 

5 products. In particular, the RNA molecules of the invention may be isolated full-length 
or partial mRNA molecules or RNA oligomers that encode CEG gene products. The 
RNA molecules of the invention include the ' nucleotide sequences encoding all or 
portions of CEGs. ( 1 ' . 

r 

10 The RNA molecules of the invention also include antisense RNA molecules, peptide 
nucleic acids (PNAs), or non-nucleic acid molecules such as phosphorothioate 
derivatives, that specifically bind to the. sense strand of DNA or RNA in a base pair- 
dependent manner. A skilled artisan can readily obtain these classes of nucleic acid 
molecules using the herein described ceg sequences. 

15 '• ' . 

t 

g) Labeled Nucleic Acid Molecules 

The nucleic acid molecules having ceg sequences can be labeled with a detectable 
marker. Examples of a detectable marker include, but are not limited to, a radioisotope, a 
20 ' fluorescent compound, a bioluminescent compound, a cherniluminescent compound, a 
metal chelator or an enzyme. Technologies for generating labeled DNA and RNA probes 
are well known in the art (See e.g. Sambrook et aL, supra). 

2.) RECOMBINANT NUCLEIC ACID MOLECULES 

25 

Also provided are recombinant nucleic .acid molecules, such as recombinant DNA molecules 
(rDNAs) that comprise ceg sequences or fragments thereof. As used herein, a recombinant 
DNA molecule is a DNA molecule that has been subjected to molecular manipulation in vitro. 
Methods for generating rDNA molecules are well known in the art, for example, see Sambrook 
30 et al., Molecular Cloning (1 989), supra. 
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a) Vectors 

The nucleic acid molecules of the invention may be recombinant molecules each 
comprising the sequence, or portions thereof, of a ceg sequence linked' to, a non-ceg 
5 sequence. For example, the ceg sequence may be fused, operatively to a vector to 
generate a recombinant molecule. The term vector includes, but is not limited to, 
plasmids, cosmids, and phagemids. A. preferred vector includes an autonomously 
replicating vdctor comprising a replicon that directs the replication of the rDNA within the 
appropriate host cell. The preferred vectors can also* include an expression control 

10 element, such as a promoter sequence, which enables transcription of the inserted ceg 
sequences and can be used for regulating the expression (e.g., transcription and/or 
translation), of an, operably linked ceg sequence in an appropriate host cell such as 
Escherichia colt Expression control elements are known in the art and include, but are not 
limited to, inducible promoters, constitutive promoters, secretion signals, enhancers, 

15 transcription terminators, and other transcriptional regulatory elements. Other expression 
control elements that are involved in translation are known in the' art, and include the Shine- 
Dalgarno sequence, and initiation and termination codons. The preferred vector also 
includes at least one selectable marker gene that encodes a gene product that confers drug 
resistance such as resistance to ampicillin or tetracyline. The vector also comprises 

20 multiple endonuclease restriction sites that enable convenient insertion of .exogenous 
DNA sequences. 

The preferred vectors for generating ceg transcripts and/or the encoded CEG polypeptides 
are expression vectors which are compatible with prokaryotic host cells. Prokaryotic cell 
25 expression vectors are well known in the art and are available from several commercial 
sources. For example, a pET vectors (e.g., pET-21, Novagen Corp.) may be used to 
express CEG polypeptides in bacterial host cells. 
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b) Recombinant Vectors for Integration 

I , ' ' ' ' I 

, 1 - 1 

l 

The present invention provides recombinant vectors that may; be used to integrate 
5 exogenously provided sequences into the genome of a host cell. The recombinant 
integration vectors of the present invention include a gene ;that encodes, a selectable 
marker and ceg sequences; or fragments thereof. The integration vectors are used to 
integrate the ceg sequence into a target gene sequence that resides within the bacterial 1 
host genome (e.g., endogenous sequence), thereby disrupting the function of the target 
10 gene sequence within the bacterial cells. These integration vectors may be used iti a gene 
disruption assay to screen candidate ceg nucleotide sequences, in order to identify the 
candidate sequences that encode a gene product that is required for bacterial cell viability, 

I i 

( i | 

Accordingly, these recombinant integration vectors include candidate ceg sequences that 

i • * 

15 ' will be screened to determine if the candidate ceg sequences encode a gene product that 
is required for cell viability. The candidate ceg sequence that is included as part of the 
recombinant integration vector is the "exogenous" ceg sequence that is employed as the 
"disrupting" sequence in a gene disruption assay. The ceg sequence that resides within 
the host genome is the "endogenous" or target" ceg sequence. 

20 

The integration event rarely occurs, for example, by non-homologous recombination in 
which a recombinant vector, that includes the exogenous ceg sequence, inserts the 
exogenous ceg sequence into a random location within the host genome. In a more 
preferred embodiment, the integration event inserts the exogenous ceg sequence into a 

25 specific target site within the host genome. The targeted integration event can involve 
homologous recombination in which the integration vector, that includes the exogenous 
ceg sequence, inserts the exogenous ceg sequence into its homologous target ceg 
sequence that resides within the host's genome (e.g., the endogenous ceg sequence) 
(Figure 1). Further, the exogenous ceg sequence can be used as a disrupting sequence 

30 whereby the homologous recombination event integrates the exogenous ceg sequence 
into the endogenous target ceg sequence resulting in disruption of the function of the 
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25 



endogenous ceg sequence. For example, disrupting the function of the endogenous ceg 

sequence may result in the loss of bacterial cell viability. 

i . . • 1 i 

. . . . , ■ • 

i 

An example of a recombinant vector, that can be used as ari integration vector in S. 

5 pneumoniae is the pEVP-3 vector (Jean-Pierre Claverys, et al. 1995 Gene 164: 123-128). 
* : i 

The pEVP-3 vector integrates an exogenous sequence by homologous recombination 

■ i t 

involving a Campbell-type event (S. Adhya and A. Campbell 1970 J. Mol Biol 50:481- 

490). The pEVP-3 vector includes a replicon that functions only in gram-negative 

i *' * - I 

bacteria, such as E. coll Therefore, the pEVP-3 vector cannot replicate in S. 

i ,.• , 

10 . pneumoniae. This vector also contains multiple cloning sites, and confers resistance to 
chloramphenicol in both a gram-negative and gram-positive bacteria, such as S. 
pneumoniae. 



c) Fusion Gene Sequences 

i 

I 

A fusion ceg gene is another example of a recombinant molecule of the invention. A fusion 
gene includes a ceg sequence operatively fused (e.g., linked) to a non-ceg sequence such as, 
for example, a tag sequence to facilitate isolation and/or purification of the expressed 
CEG gene product (Kroll, D.J., et al., 1993 DNA Cell Biol 12:441-53). 

Alternatively, a recombinant fusion molecule has a ceg sequence of the invention fused to 
a ceg sequence isolated from a different microbial source. For example, the disclosed ceg 
sequences isolated from £ pneumoniae can be fused to a ceg sequence isolated from a 
different bacterial species. 

3.) CEG PROTEINS AND POLYPEPTIDE MOLECULES 



The invention additionally provides CEG proteins and peptide fragments thereof that are 
isolated or substantially purified. Embodiments of particular CEG amino acid sequences 
30 are disclosed in Tables I and H (SEQ ID NOS:l 14-226 and SEQ ID NOS:332-436, 
respectively). 
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The present invention also includes polypeptides having sequence variations from the 

i 

predicted CEG polypeptide sequences disclosed hereinj including mutant variants, 
conservative substitution variants, and similar CEG polypeptides from dther prokaryotic 
5 organisms. For convenience, such proteins are referred to herein as "CEG proteins", 

* * 

"CEG polypeptides", or "proteins of the invention". . . ' . . 

if . . ■ ■ i 

As used herein, CEG protein refers to a polypeptide having amino acid sequence identity or 

*■ • 

similarity to any one of the predicted amino acid sequences, as provided in SEQ ID NO.: 

t 

10 114-226 or 332-436. The variant CEG polypeptides can be allelic forms of CEG, such as 
mutant forms of CEG polypeptides. The present invention also provides conservative 
substitution-mutants of the CEG proteins that maintain functional activity of wild-type CEG 
(e.g., the CEG polypeptide is required for bacterial cell viability). 

! 

15 The CEG protein may be isolated from any source whether natural, synthetic, semi- 

i 

synthetic, or recombinant. As used herein, "natural" refers to a polypeptide which is 
found in nature. Accordingly, the CEG proteins may be isolated from a prokaryotic 
organism, such as a bacterial strain including, but not limited to, Streptococcus, 
Escherichia, Bacillus, Pseudomonas, Yersinia, Salmonella, and Streptomyces. The CEG 
20 proteins of the invention, and fragments thereof, can also be generated by recombinant 
methods or chemical synthesis methods. 

The CEG polypeptides of the invention are essential for the viability of a bacterial cell. 
Further, the CEG polypeptides can exhibit at least any one of the following functions: a 
25 pantothenate kinase, a Holliday Junction branch migration protein, a single stranded • 

* 

DNA binding protein, a phosphoglucosamine mutase, an acetyltransferase, an 
uridylyltrarisf erase, a malonyl CoenzymeA:ACP transcylase, a 3-oxoacyl-ACP synthase 
II, a 3-oxoacyl-ACP reductase, a phosphomethylpyrimidine (HMP-P) kinase, a GTP 
binding protein, a ATP binding protein, or a 4-aminoimidazole carboxylase. Putative 
30 functions can include, but are not limited to, sugar transferase, techoic acid biosynthesis, 
ribosome recycling factor, response regulator, nicotinate phosphoribosyltransferase, 
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nitropropane dioxygenase, (3R)-hydroxymyristol acyl carrier protein dehydrase, sugar 

dehydrogenase, murein biosynthesis, cobalimin biosynthesis, ABC transporter, tRNA 

1 modification enzyme, arylsulfatase, 16S processing enzyme, tRNA methyl transferase, 

elongation factor P, signal recognition particle, protein export, undecaprpnol kinase, SRP 

5 docking domain, diacyl glycerol kinase, dihydopicilinate reductase, HU-DNA binding 

protein, thiamine biosynthase, GreA transcription elongation factor, dTDP-L-rhamnose 

synthase, ATP-binding motif, ribose-5-p-3-epimerase-like activity, » GTP 

1 • • • 1 1 1 i , 

pyrophosphokinase', acetyl-CoA carboxylase, O-sialoglycoprotein endopeptidase, 

glucosamine-fructose-6-phosphase aminotransferase, Strpn adhesion-associated ABC- 

10 permease, GTP pyrophosphokinase RelA, IMP dehydrogenase, DNA gyrase subunit B, 

acetyl-CoA carboxylase subunit AccD, phosphoglycerol kinase, acetyl-CoA carboxylase 

carbonyl transferase,, phosphopanthetheine adenylyltransferase, oligopeptide transport 

* » i * 

permease subunit, translocation protein, perM permease, DNA pol HI gamma and tau 

- i * 

subunits, DNA pol III delta subunit,; signal peptidase I, acetyl-coA carboxylase biotin 
15 carboxyl carrier protein, protein chain release factor- 1, replicative DNA helicase, 
topoisomerase, pentapeptide-transferase, elongation factor G, spore coat polysaccharide 
biosynthesis protein C, protein release factor B, DNA polymerase III alpha subunit, 
phosphoprotein phosphatase, chaparonin, UDP-N-acetylmuramoylalanyl-D-glutamate-2, 
6-diaminopimelate ligase, techuronic acid biosynthesis, UDP-glupose lipid carrier 
20 transferase, transcription termination factor, chromosome segregation factor, amino acid 
biosynthesis, HMG-CoA reductase, hypoxanthine-guanine phosphoribosyltransferase. 

a) MODULATORS OF CEG POLYPEPTIDES 

25 The invention provides compounds that modulate (e.g., activate or inhibit) the function of 
a CEG polypeptide. Such compounds can provide lead-compounds for developing drugs 
for diagnosing and/or treating conditions associated with bacterial infections. The 
modulator is a* compound that may alter the function of the CEG polypeptide, such as 
activating or inhibiting the function of a CEG polypeptide. For example, the compound 

30 can act as agonist, antagonist, partial agonist, partial antagonist, cytotoxic agents, 
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inhibitors of cell proliferation, and cell proliferation-promoting agents. The activity of 
the compound may be known, unknown or partially known. 

Suitable ligands include, but are not limited to, diazalactones,. J\T-protected amino acid, 
azabicyclodiene, and alkaloids. 

1 i 
• -I 
An example of a diazalacto'ne is: 




An example of a .W-protected amino acid is: ' 




O- 

10 

An example of an azabicyclodiene is: 
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Examples of alkaloids are: ■ , , * 




i 



5 B) METHODS FOR MAKING THE CEG PROTEINS AND POLYPEPTIDES 

Recombinant methods are preferred if a high yield is desired. Recombinant , methods 
involve expressing the cloned gene in a suitable host cell. For example, a host cell is 
introduced with an expression vector having the CEG sequence, then the host cell is 
10 cultured under conditions that permit in vivo production of the CEG protein. The 
recombinant vector can integrate the CEG sequence into the host genome. Alternatively, 
the CEG sequence can be maintained extra-chromosomally, as part of ah autonomously 
replicating vector. 

15 1. HOST-VECTOR SYSTEMS 

The invention further provides a host-vector system comprising the vector, plasmid, 
phagemid, or cosmid comprising a ceg nucleotide sequence, or a fragment thereof, 
introduced into a suitable host cell. The host-vector system can be used to produce the 
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CEG polypeptides encoded by the ceg nucleotide sequences. The host , cell can be 
prokaryotic or eukaryotic. Examples of suitable prokaryotic host cells include bacteria 
■ strains from genera such as Escherichia, Bacillus, Pseudomdnas, Streptococcus, 'and 
Streptomyces. Examples of suitable eukaryotic host cells include a yeast cell, a plant cell, 
5 or an animal cell, such as a mammalian cell. A preferred embodiment provides a host- 
vector system comprising the pET21 vector having a ceg sequence introduced into an E. 
coli A,DE3 lysogen which is useful, for example for the production of the CE|G protein, 
herein designated CFE polypeptides and CFE proteins. 

• i 

1 0 Introduction of the rDNA molecules of the present invention into an appropriate cell host is 
accomplished by well known methods that typically depend on the type of vector used and 
host system employed. For example, transformation of prokaryotic host cells by 
electroporation and salt treatment methods are typically employed, see for example, Cohen 
et al, 1972 ProcAcadSci USA 69:2110; Maniatis, T., et al., 1989 Molecular Cloning, A 

i 

15 Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY. 
Transformation of vertebrate cells with vectors containing rDNAs, electroporation, cationic 
lipid or salt treatment methods are typically employed, see, for example, Graham et al, 
1973 Virol 52:456; Wigler et al., 1979 Proc Natl Acad Sci USA 76:1373-76. 

* 

20 Successfully transformed cells, i.e., cells that contain a rDNA molecule of the present 
invention, can be identified by well known techniques. For example, cells resulting from 
the introduction of a rDNA of the present invention can be selected and cloned to produce 
single colonies. Cells from those colonies can be harvested, lysed and their DNA content 
examined for the presence of the rDNA using a method such as that described by Southern, 

25 JMol Biol (1975) 98:503, or Berent et al., Biotech (1985) 3:208, or the proteins produced 
from the cell assayed via a biochemical assay or immunological method. 

Procaryotes are generally used as host cells for cloning and producing the products of 
exogenous DNA sequences. For example, the Escherichia coli K12 BL21 (AJDE3) 
30 (Novagen) is particularly useful for expression of foreign proteins. Other strains of E. 
colU and bacilli such as Bacillus subtilis, Enterobacteriaceae such as Salmonella 
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typhimurium or Serratia marcescans, various Pseudomonas, StreptocQccus, and 
Streptomyces species may also be employed as host cells in cloning and expressing the 
1 recombinant proteins of this invention. ' 

* ■ 

5 In general terms, the production of , recombinant CEG proteins may involve using a 
host/vector system/ or other methods may be used. The host/vector system may employ the 

following steps. 

■ ■< i 

i 1 1 i ' 

A nucleic acid molecule is obtained that encodes a CEG protein or a fragment thereof, such 

10 as any one of the polynucleotides disclosed in SEQ ID NOs.: 1-1 13 or 227-33 1. The CEG- 
encoding nucleic acid molecule is preferably inserted into an expression vector in operable 
linkage with suitable expression control sequences, to generate an expression vector 
including the CEG-encoding sequence. The expression vector is introduced into a suitable 
host, by standard transformation methods, and the resulting transformed host is cultured 

15 under conditions that allow the production of the CEG protein. For example, if expression 
of the CEG gene is, under the control of an inducible promoter, then suitable growth 
conditions would include the appropriate inducer. The CEG protein (e.g., designated a 
CFE polypeptide or protein), so produced, is isolated from the growth medium or directly 
from the cells; recovery and purification of the protein may not be necessary in some 

20 instances where some impurities may be tolerated. A skilled artisan can readily adapt an 

< 

appropriate host/expression system known in the art for use with CEG-encoding sequences 
to produce a CEG protein (Cohen, et al. , supra; Maniatis et al., supra). 

Host cells harboring the nucleic acids disclosed herein are also provided by the present 
25 invention. A preferred host is E. coli strain BL21(^DE3) transfected or transformed with 
a vector comprising a nucleic acid of the present invention. The invention also provides a 
host cell capable of expressing the ceg sequences described herein. The preferred host 
cell is any strain of E. coli that can accommodate high level expression of an exogenously 
introduced gene. 

30 
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The proteins of the present invention can also be made by chemical synthesis. The 
principles of solid phase chemical synthesis of polypeptides are well known in the art and 
may be found in general texts relating tp this area (Dugas, H. and Penney, C. 1981 
Bioorganic Chemistry, pp 54-92, Springer-Verlag, New York). CEG polypeptides may 
5 be synthesized by solid-phase methodology utilizing an Applied Biosystems 43 OA 
peptide synthesizer (Applied Biosystems, , Foster City; Calif.) and synthesis cycles' 
supplied by Applied Biosystems. Protected amino acids, such as t-butoxycarbonyl- 
protected amino acids, and other reagents are commercially available from many 
chemical supply houses. 1 ( , 

10 1 . 

The polypeptides of the invention exhibit properties of a CEG protein, such as for 
example, the ability to elicit the generation of antibodies that specifically bind an epitope 
associated with CEG polypeptides. Accordingly, the CEG polypeptide, or any 
oligopeptide thereof, is capable of inducing a specific immune response in appropriate 
15 animals or cells and binding with specific antibodies. 

i 

O ANTIBODIES THAT RECOGNIZE AND BIND THE PROTEINS AND 
POLYPEPTIDES OF THE INVENTION ~~ 

20 The invention further provides antibodies (e.g., polyclonal, monoclonal, chimeric, 
humanized, and human antibodies) that bind a CEG polypeptide. The most preferred 
antibodies will selectively bind a CEG polypeptide and will not bind (or will bind weakly) a 
non-CEG polypeptide. Antibodies that are particularly' contemplated include monoclonal 
and polyclonal antibodies, as well as fragments thereof (e.g., recombinant proteins) which 

+ 

25 include the antigen binding domain and/or one or more complement determining regions of 
these antibodies. These antibodies can be from any source, for example, rabbit, sheep, rat, 
dog, cat, pig, horse, mouse, and human. 

The invention encompasses antibody fragments that specifically recognize a CEG 
30 polypeptide. As used herein, an antibody fragment is defined as at least a portion of the 
variable region of the immunoglobulin molecule that binds to its target, i.e., the antigen 
binding region. Some of the constant region of the immunoglobulin may be included, 
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) i 1 ' * 

As will be understood by those skilled in the art, the regions or epitopes of a CEG 

polypeptide to which an antibody is directed may vary \yith the intended application. For 

• ■ i 

example, antibodies intended for use in an immunoassay for the detection bf membrane- 
5 bound CEG proteins on viable bacterial cells should be directed to an accessible epitope 
on membranerbound CEG proteins. Antibodies that recognize other epitopes may be 
useful for the identification of CEG protein y/ithin damaged 'or dying cells, for the 

detection of secreted CEG protein or fragments thereof. ■ 

i ■ 

+ * 

10 Various methods for the preparation of antibodies are well known in the art. For example, 
antibodies may be prepared by immunizing a suitable mammalian host using a CEG protein, 
peptide, or fragment, in isolated or immunoconjugated form (Harlpw, 1989 Antibodies ; Cold 
Spring Harbor Press, NY). In addition, fusion proteins comprising CEG polypeptides rnay 
also be used, such as a CEG protein/GST-fusion protein. Cells expressing or overexpressing 

15 a CEG polypeptide may also be used for immunizations. Similarly, any cell engineered to 
express CEG protein may be used. This strategy may result in the production of monoclonal 
antibodies with enhanced capacities for recognizing endogenous CEG protein. 

■ * * 

The present invention contemplates chimeric antibodies that comprise a human and non- 
20 human immunoglobin portion. The antigen combining region (variable region) of a 
chimeric antibody can be derived from a prokaryotic source (e.g., bacteria) and the 
constant region of the chimeric antibody which confers biological effector function to the 
immunoglobulin can be derived from a eukaryotic source (e.g., human). The chimeric 

* 

antibody should have the antigen binding specificity of the prokaiyotic antibody 
25 molecule and the effector function conferred by the eukaryotic antibody molecule. 

In one example, the procedure used to produce chimeric antibodies can involve the 
following steps: 

a) Identifying and cloning the correct immunoglobin gene segment encoding the 
30 antigen binding portion of the antibody molecule. This gene segment is known as 

the VDJ, variable, diversity and joining regions for heavy chains or VJ, variable, 
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joining regions for light chains or simply as the V or variable region. This gene 
regions may be in either the cDNA or genomic form; 
1 t() Cloning the gene segments encoding the constant region or desired parf thereof; 1 1 

c) Ligating the variable region with the constant region sq that the complete chimeric 
5 antibody is encoded in a form that can be transcribed and translated; 

d) Ligating this construct into a vector containing a selectable marker and gene control 
, regions such as promoters, enhancers and poly(A) addition signals; 

e) Amplifying this construct in bacteria; 

I 

' f) Introducing this DNA into eulcaryotic cells (transfection) most often mammalian 
10 lymphocytes; j 1 

g) Selecting for cells expressing the selectable marker; 

h) Screening for cells expressing the desired chimeric antibody; and 

■ k) Testing the antibody for appropriate binding specificity and effector functions. 

* 

i 

■ t 
1 5 Chimeric antibodies of several distinct antigen binding specificities have been produced 

i 

by protocols well known in the art, including anti-TNP antibodies (Boulianne et al., 1984 
Nature 312:643); and anti-tumor antigen antibodies (Sahagan et al., 1986 J. Immunol 
137:1066). Likewise, several different effector functions have been achieved by linking 
new sequences to those encoding the antigen binding region. Examples of these include 

20 enzymes (Neuberger et al., 1984 Nature 312:604); immunoglobulin constant regions 
from another species and constant regions of another immunoglobulin chain (Sharon et 
al., 1984 Nature 309:364; Tan et al., 1985 J. Immunol. 135:3565-3567). Additionally, 
procedures for modifying antibody molecules and for producing chimeric antibody 
molecules using homologous recombination to target gene modification have been 

25 described (Fell et al., 1989 Proc. Natl Acad Set USA 86:8507-851 1). 

The predicted amino acid sequence of a CEG protein may be used to select specific regions 
of the CEG protein for generating antibodies. For example, hydrophobicity and 
hydrophilicity analyses of a CEG polypeptide may be used to identify hydrophobic and 
30 hydrophilic regions in the CEG protein. Regions of the CEG protein that show 
immunogenic structure, as well as other regions and domains, can readily be identified using 
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various other methods known in the art, such as Chou-Fasman, Garnier-Robsori , Kyte- 

i 

Doolittle, Eisenberg, Karplus-Schult or Jameson- Wolf analysis.. Fragments that include the 
immunogenic regions are particularly suited for generating specific classes of antibodies. 

5 Methods for preparing a protein for use as an immunogen and for preparing immunogenic 
conjugates of a protein vyith a carrier such as BSA, KLH, or other carrier proteins are well 
known in the art. In some circumstances, direct, conjugation using, for example, 
carbodiimide 'reagents may be used; in' other instances linking reagents such as those 
supplied by Pierce Chemical Co., Rockford, EL, may be effective. .Administration of a CEG 
10 immunogen is conducted generally by injection over a suitable time period and with use of a 
suitable adjuvant, as is generally understood in the art. During the immunization schedule, 
titers of antibodies can be taken to determine adequacy of polyclonal antibody formation. 

* 

While the polyclonal antisera produced in this way may be satisfactory for some 
15 applications, for pharmaceutical compositions, monoclonal antibody preparations ,are 
preferred. Immortalized cell lines which secrete a desired monoclonal antibody may be 
prepared using the standard method of Kohler and Milstein (Nature 256: 495-497) or other 
techniques as described in Monoclonal Antibodies; A Manual of Techniques, CRC press*. 
Inc., Boca Raton, Fla. (1987) ed. Zola. The immortalized cell lines secreting the desired 
20 antibodies are screened by immunoassay in which the antigen is the CEG polypeptide 
having binding activity, or a fragment thereof. When the appropriate immortalized cell 
culture secreting the desired antibody is identified, the cells can be cultured either in vitro or 
by production in ascites fluid. 

25 The desired monoclonal antibodies are then recovered from the culture supernatant or from 
the ascites supernatant. Fragments of the monoclonal antibodies of the invention or the 
polyclonal antisera (e.g., Fab, F(ab% Fv fragments, fusion proteins) which contain the 
immunologically significant portion (i.e., a portion that recognizes and binds a CEG protein) 
can be used as antagonists, as well as the intact antibodies. Humanized antibodies directed 

30 against a CEG polypeptide are also useful. The advantage of using humanized antibodies is 
that they are less immunogenic in humans. As used herein, a humanized antibody is an 
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immunoglobulin molecule which is capable of binding to a CEG polypeptide and which 

1 . . . - 

comprises a FR region having substantially the amino acid sequence of a human 

immunoglobulin and a CDR having substantially the amino acid , sequence of hon-humati 

immunoglobulin or a sequence engineered to.bind a CEG protein. Methods for humanizing 

murine and other non-human antibodies by substituting one or more of the non-human 

antibody CDRs for corresponding human antibody sequences arp well knowrjL (Jones et al., 

1986 Nature 321: 522-525; Riechmnan et al., 1988 Nature332: 323-327; Verhopyen et al., 

1988 Science 239:1534-1536; Carter et al., 1993 Prod Natl Acad Set USA 89: 4285;' 

and Sims et al., 1993 J. Immunol 151:|2296). 

i 
I 

Use of immunologically reactive fragments, such as the Fab, Fab 1 , or F(ab^2 fragments is 
often preferable, especially in a therapeutic context, as these fragments are generally less 

immunogenic than the whole immunoglobulin. Further, bi-specific antibodies specific for 

i 

two or more epitopes may be generated using methods generally known in the art. Further, 
antibody effector functions may be modified so as to enhance the therapeutic effect of the 
antibodies of the invention. For exampte, cysteine residues may be engineered into the Fc 
region, permitting the formation of interchain disulfide bonds and the generation of 
homodimers which may have enhanced capacities for internalization, ADCC and/or 
complement-mediated cell killing (Caron et al., 1992 J. Exp. Med 176: 1191-1195; 
Shopes, 1992 J. Immunol 148: 2918-2922). Homodimeric antibodies may also be 
generated by .cross-linking techniques known in the art (Wolff et al., Cancer Res. 53: 2560- 
2565). The invention also provides pharmaceutical compositions having the monoclonal 
antibodies or anti-idiotypic monoclonal antibodies of the invention. 

The antibbdies or fragments may also be produced, using current technology, by 

* ■ 

recombinant means. Regions that bind specifically to the desired regions of the CEG 
protein can also be produced in the context of chimeric or CDR* grafted antibodies of 
multiple species origin. The invention includes an antibody, e.g., a monoclonal antibody 
which competitively inhibits the immunospecific binding of any of the monoclonal 
antibodies of the invention to a CEG protein. 

- ^. 
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Alternatively, methods for producing fully human monoclonal antibodies, include phage 
display and transgenic methods, are known and may be used for the generation of human 
1 monoclonal antibodies (reviewed in: Vaughan et al.,' 1998 Nature Biotechnology 16: $35- 
539). For example, fully human monoclonal antibodies may be generated using cloning 
5 technologies employing large human Ig gene combinatorial libraries (i.e., phage display) 
(Griffiths and Hoogenboom, "Building an in vitro immune system: human antibodies from 
phage display libraries"^ in: Protein Engineering of Antibody Molecules for Prophylactic 
and Therapeutic Applications in Man, Clark, M. (Ed.), Nottingham Academic, pp 45-64 
(1093); Burton and Barbas, "Human Antibodies from combinatorial libraries" Id., pp 65- 
10 82). Fully human monoclonal antibqdies may also be produced using transgenic mice 
engineered to contain human immunoglobulin gene loci as described in PCT Patent 
Application W098/24893, Jakobovits et al., published December 3, 1997 (see also, 

i i 

! * 

Jakobovits, 1998 Exp. Opiru Invest Drugs 7: 607-614). This method avoids the in vitro 
manipulation required with phage display technology and efficiently produces high affinity, 
1 5 authentic human antibodies. 



The antibody or fragment thereof of the invention may be labeled with a detectable 
marker or conjugated to a second molecule, such as a therapeutic agent (e.g., a cytotoxic 

* * 

agent) thereby resulting in an immunoconjugate. For example, the therapeutic agent 
20 includes, but is not limited to, an anti-tumor drug, a toxin, a radioactive agent, a cytokine, 
a second antibody or an enzyme. Further, the invention provides an embodiment wherein 
the antibody of the invention is linked to an enzyme that converts a prodrug into a 
cytotoxic drug. 

25 Examples of cytotoxic agents include, but are not limited to ricin, ricin A-chain, 
doxorubicin, daunorubicin, taxol, ethiduim bromide, mitomycin, etoposide, tenoposide, 
vincristine, vinblastine, colchicine, dihydroxy anthracin dione, actinomycin D, diphteria 
toxin, Pseudomonas exotoxin (PE) A, PE40, abrin, arbrin A chain, modeccin A chain, 
alpha-sarcin, gelonin, mitogellin, retstrictocin, phenomycin, enomycin, curicin, crotin, 

30 calicheamicin, sapaonaria officinalis inhibitor, and glucocorticoid and other 
chemotherapeutic agents, as well as radioisotopes such as 212 Bi, 13, 1, 13 l ln 9 90 Y, and 186 Re. 
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Suitable detectable markers ! for diagnostic used include, but are not limited to, a 
radioisotope, a fluorescbnt compound, a bioluminescent compound, chemiluminescent 

compound, a metal chelator or an enzyme: . Antibodies may also be conjugated to an anti- 

i • 

5 cancer pro-drug activating enzyme capable of converting the pro-drug : to its active form. 

See, for example, U.S. Patent Nos. 4,952,3 94 and 5,7 16,990. ' 

1 if . - * i 

i 1 

Additionally, i a recombinant protein of the invention comprising the antigen-binding 
region of any of the 1 monoclonal antibodies of the invention can be made. In such a 

■ i i ' * 

I 

, i 

10 situation, the antigen-binding region of the recombinant protein is joined to at least a 
functionally active portion of a second protein having therapeutic activity. The second 
protein can include, but is not limited to, an enzyme, lymphokine, oncostatin or toxin. 
Suitable toxins include those described above. 

15 Techniques for conjugating or joining therapeutic agents to antibodies are well knpwn 
(Anion et al., "Monoclonal Antibodies For Immunotargeting Of Drugs In Cancer Therapy", 
in: Monoclonal Antibodies And Cancer Therapy, Reisfeld et al. (eds.), pp. 243-56, Alan R. 
Liss, Inc. 1985; Hellstrom et al., "Antibodies For Drug Delivery", in: Controlled Drug 
Delivery (2nd Ed.), Robinson et al. (eds.), pp. 623-53, Marcel Dekker, Inc. 1987; Thorpe, 

20 "Antibody Carriers Of Cytotoxic Agents In Cancer Therapy: A Review", in: Monoclonal 
Antibodies '84: Biological And Clinical Applications, Pinchera et al. (eds.), pp. 475-506 
(1985); and Thorpe et al., "The Preparation And Cytotoxic Properties Of Antibody-Toxin 
Conjugates", in: Immunol Rev., 62:119-58 (1982)). Techniques for joining detectable 
markers to antibodies are also known. 

25 

D) PHARMACEUTICAL COMPOSITIONS OF THE INVENTION 

The invention includes pharmaceutical compositions for use in the treatment of microbial 
infections comprising a pharmaceutically effective amount of an anti-CEG antibody or a 
30 CEG polypeptide. 
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In one embodiment, the pharmaceutical compositions may comprise a CEG antibody, 

either unmodified, conjugated to a therapeutic agent (e.g., drug, toxin, enzyme or second 

1 t ■ , ' . > .r , .... 

antibody) or in a recombinant form (e.g., chimeric or ^specific). The compositions may 
additionally include other antibodies or conjugates (e.g., an antibody cocktail). 
5 * 

The pharmaceutical compositions also preferably include suitable carriers and adjuvants 

- which include any material which when combined with the molecule of the invention 

1 , 1 i •, ■ 

(e.g., &n anti-CEG antibody or a CEG protein) retains the molecule's activity and is non- 

1 r 

reactive with the subject's immune systems, , Examples of suitable carriers and adjuvants 

■ 

10 . include, but are not limited to, human serum albumin, ion exchangers, alumina, lecithin, 
buffer substances such as phosphates, glycine, sorbic acid, potassium sorbate, and salts or 
electrolytes such as protamine sulfate. Other examples include any of the standard 
pharmaceutical carriers such as a phosphate buffered saline solution, water, emulsions 

( i " i 

such as oil/water emulsion, and various types of wetting agents; Other carriers may also 
15 include sterile solutions, tablets including coated tablets and .capsules. Typically such 
carriers contain excipients such as starch, milk, sugar, certain types of clay, gelatin, 
stearic acid or salts thereof, magnesium or calcium stearate, talc, vegetable fats or oils, 
gums, glycols, or other known excipients. Such carriers may also include; flavor and 
color additives or other ingredients. Compositions comprising such carriers are 
20 formulated by well known conventional methods. Such compositions may also be 
formulated within various lipid compositions, such as, for example, liposomes as well as 
in various polymeric compositions, such as polymer microspheres. 

* 

The pharmaceutical compositions of the invention can be administered using 
25 conventional modes of administration including, but not limited to, intravenous, 
intraperitoneal, oral, ijitralymphatic or administration directly into the tumor. 
Intravenous administration is preferred. 

The pharmaceutical compositions of the invention may be in a variety of dosage forms 
30 which include, but are not limited to, liquid solutions or suspensions, tablets, pills, 
powders, suppositories, polymeric microcapsules or microvesicles, liposomes, and 
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injectable or infusible solutions. . The preferred form depends upon the mode of 
administration and the therapeutic application. 

i 

The CEG polypeptides and proteins of this invention are found in common pathogenic 
5 bacterial species such as Streptococcus pneumoniae. This organism causes upper 
respiratory tract infections. Thus, the peptides and proteins of this invention can be used 

* * * 

as immunogens in subunit vaccines for vaccination against a pathogenic bacteria such as 

, . . ' f . . '• , i ' ' ■ • , 

Streptococcus pneumoniae. Additionally, the ceg sequences o'f the invention can be used 

as'DNA vaccines (U.S. Patent No. 5,736,524 and U.S. Patent No. 5,989,553). 

The polypeptides and proteins of this invention can be formulated , as univalent and 
multivalent vaccines. The protein can be mixed, conjugated or fused with other antigens, 

including B or T cell epitopes of other iantigens. * 

i 

15 Further, when a haptenic peptide of the proteins of the invention is used, (i.e., a peptide 
which reacts with cognate antibodies, but cannot itself elicit an immune response), it can 
be conjugated to an immunogenic carrier molecule. Conjugation to an immunogenic 
carrier can render the oligopeptide immunogenic. Examples of carrier molecules are 
tetanus toxin or toxoid, diphtheria toxin or toxoid and any mutant forms of these proteins 

20 such as CRM.suKl97. Others include exotoxin A of Pseudornonas, the heat labile toxin 
of E. colt and rotaviral particles (including rotavirus and VP6 particles). Alternatively, a 
fragment or epitope of the carrier protein or other immunogenic protein can be used. For 
example, the happen can be coupled to a T cell epitope of a bacterial toxin. 

25 In formulating the vaccine compositions with the CEG polypeptides or proteins of the 
invention, alone or in the various combinations described, the immunogen is adjusted to 
an appropriate concentration and formulated with any suitable vaccine adjuvant. Suitable 
adjuvants include, but are not limited to: surface active substances, e.g., hexadecylamine, 
octadecylamine, octadecyl amino * acid esters, lysolecithin, * dimethyl- 

30 dioctadecylammonium bromide), methoxyhexadecylgylcerol, and pluronic polyols; 
polyamines, e.g., pyran, dextransulfate, poly. IC, carbopol; peptides, e.g., muramyl 

55 



WO 01/49721 PCT/US00/35604 



10 



15 



25 



dipeptide, dimethylglycine, tuftsin; oil emulsions; and mineral gels, e.g., aluminum 

hydroxide, aluminum phosphate, etc. and immune stimulating complexes. The 
immunogen may also be incorporated into liposomes, or conjugated to polysaccharides 
and/or other polymers. ( ' > ■ ■ *' 



The vaccines can be administered to , a human or animal in a variety of ways. These 

■ ■ * 

include intradermal, intramuscular, intraperitoneal, intravenous, subcutaneous, pral and 
intranasal routes of administration. Further, the vaccines can be live or inactivated 



vaccines. 



i r 



The most effective mode of administration and dosage regimen for the compositions of 
this invention depends upon the severity and course of the disease, the patient's health 

and response to treatment and the judgment of the treating physician. Accordingly, the 

i 

dosages of the compositions should be; titrated to the individual patient. 



E) USES OF THE MOLECULES OF THE INVENTION 

i * 

1) MOLECULAR WEIGHT MARKERS 

20 The nucleic acid molecules of the invention and their encoded proteins may be employed 
as molecular weight markers. For example, the molecular weight of each of the nucleic 
acid molecules having ceg sequences and their predicted polypeptides can be determined 
and can be used to compare against other gene sequences and proteins whose molecular 
weights are unknown. 



2) DIAGNOSTICS 



The nucleic acid molecules of the invention may be employed in diagnostic 
embodiments. For example, the presence of nucleotide sequences which are identical or 
30 similar to the ceg sequences of the invention may be detected within a biological sample. 



56 



NSDOCID: <WO_01 49721 A2 I > 



WO 01/49721 PGT/US00/35604 

The biological sample' may include blood, serum or a swab from nose, ear or throat, may 

be determined by means of a nucleic acid detection assay, 

■ i * ■ 

I ' I ' , ' 

' I 

Nucleic acid probes or primers having sequences complementary to ceg sequences may 

* I 

5 be used in a hybridization assay to detect the presence of the sequences which are 
identical or similar to the ceg sequences of the invention in the biological samples. 
Typically, nucleic acids molecules obtained frbm a suitable biological sample are 

i * , + 

hybridized with labeled probes or primers. , The resulting hybridized molecules are 

detected and resolved by methods well known in the' art , such as Northern or Southern 

. .i ' 

10 blotting, micro-array technology, or amplifying with PGR technology. Other 
hybridization techniques arid systems are known that can be used in connection with the 
detection aspects of the invention, including diagnostic assays such as those described in 
Falkow et al., U.S. Pat. No. 4,358,535. 

i 

1 1 

15 .Examples of the PCR technology are disclosed in U.S. Patent Nos. 4,683,202 and 
4,965,188 (incorporated herein by reference). Generally, nucleic acid molecules are 
obtained from a suitable biological source and contacted with two primers corresponding 
to the ceg sequences disclosed herein, under conditions which allow for hybridization and 
polymerization to occur. A pair of probes, one corresponding to the 5 ! flanking region 

20 and the other corresponding to the 3' flanking region, would be sufficient to detect the 
nucleic acid molecules of the invention in a biological sample and may be used to 
indicate the amount of bacteria present. 

Alternative methods of detecting nucleic acid molecules include, for example, in situ ' 
25 hybridization techniques, where a ceg probe is used to detect homologous sequences 
within one or more cells, such as cells within a clinical sample or even cells grown in 
tissue culture. As is well known in the art, the cells are prepared for hybridization by 
fixation, e.g. chemical fixation, and placed in conditions that allow for the hybridization 
of a detectable probe with nucleic acids located within the fixed cell. 
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The amount of ceg sequences present in a biological sample , can be quantified and 
compared to the levels in a normal or "healthy" sample. For example, ceg sequences 
present in either increased or decreased levels, compared to the levels foynd in the 
control sample may indicate the presence of bacteria. This, information is useful for 
5 ' diagnosis of a bacterial infection that requires treatment with an antibacterial agent. 

' , ,■ ■ 

Alternatively, the amount of CEG polypeptides present in a biological sample .may be 

1 i j 1 * 1 ( 

determined by means of an immunoassay. For example, labeled antibodies reactive 

against CEG polypeptides may be used in an immuno-reactive assay to detect the 

10 presence of CEG polypeptides in the biological samples. 

I 

3) SCREENING CANDIDATE CEG SEQUENCES 

a) Gene Disruption Assay , 

15 

The ceg nucleotide sequences of the invention can be used to identify nucleotide 
sequences which are identical or similar to the ceg sequences that are required for 
bacterial cell viability. For example, the ceg sequences can be used in a bacterial gene 
disruption assay to screen candidate nucleotide sequences to identify sequences required 
20 for bacterial cell viability. 

The disruption assay can involve: introducing into a host cell a recombinant vector that is 
capable of integration into the host genome, where the recombinant vector, includes a 
candidate sequence that putatively encodes a cell-viability gene product (e.g., the 
25 exogenous ceg sequence); the vector integrates the candidate sequence into a target 
sequence within the host's genome (e.g., the endogenous ceg sequence); and the host cell, 
so introduced, is screened for viability. The recombinant vector preferably includes a 
selectable marker so that the introduced host cell can be screened for viability in the 
presence of a selectable agent. 

30 
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For example, Figure 1 shows a schematic representation of a gene disruption assay, 
within a bacterial host cell. In Figure 1 A, the recombinant vector,, pEVP3, includes the 

, 'I i . s i * 1 1 i • r ,' i ; . i ' • 

CAT gene (e.g., the selectable marker chloramphenicol acetyl transferase) and an internal 
region of the ceg disrupting sequence; the internal region excludes the 5* and 3* ends of 
5 the ceg sequence. The "X" in Figure 1 indicates the recombinant pEVP3 vector undergoing 
homologous recombination with the target sequence * (e.g., within the host genome). In 

| . • 

Figure IB, the resolved pEVP3 vector that is integrated into the host genome, is shown. 
Left to right are the following elements: the native promoter of the target gene; a 5' partial 
copy of the target gene; the body of the integrated pEVP3 vector including the disrupting 

t 

10 gene and CAT; and, a 3' partial copy of the target gene. Thus, integration of the pEVP3 
vector via homologous recombination results in two partial gene duplications flanking the 
integrated vector. If the target gene is not essential for survival, it is possible to recover 
chloramphenicol-resistant colonies of S. pneumoniae. Failure to recover chloramphenicol 
resistant colonies, in the presence of the proper controls as described below, indicates that 

15 the target gene may be essential for cell viability. 

More particularly, the gene disruption assay for screening candidate ceg sequences can 
involve the following steps. The recombinant pEVP-3. vector encoding CAT resistance 
and having a fragment of a candidate ceg sequence, can be introduced into 

20 transformation-competent S. pneumoniae cells by methods that are well-known in the art 
(Lee, M.S., et al., 1998 Appl Environ. Microbiol 64:4796-4802). The preferred size of 
the ceg fragment can be between about 200 to about 500 bp in length. It is advantageous 
that the candidate ceg sequence does not include the 5' and V ends that encode the N- 
and C-terminal ends of the CEG polypeptide. This insures that the inserted ceg fragment 

25 and the disrupted endogenous ceg gene sequence are not capable of expression of a full- 
length, functional ceg gene product. The transformation-competent cells can be obtained 
by performing the transformation step in the presence of a heptadecapeptide that induces 
competence for transformation of S. pneumoniae (Havarstein, L. S., et aL, 1995 Proc. 
Natl Acad Sci. 92:11140-11144), such as the CSP-1 peptide. The CSP-1 can be 

30 naturally-derived or synthetic. Additionally, the transformation step can.be optimized by 
performing the transformation when the cells have reached a density which is optimal for 
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transformation (e.g: r 3 X 10 cells per ml.) (Havarstein, L. S. et al. supra). The 
recombinant vector can be introduced into the competent pneumococci and may undergo 

homologous recombination, whereby the candidate ceg fragment recombines with the 

• 1 1 i 

corresponding endogenous ceg sequence,' resulting in targeted ' integration' of the vector 
5 into the pneumococcal genome and disruption of the endogenous ceg. 

1 i i i * ' ■ • 

ii , . • ■ i 

The transformed cells can be plated on or cultured in chloramphenicol^containing growth 

medium. The cells can be cultured under standard conditions, such as 37° C in 5% CO2 

for approximately 46 to 48 hours, for the purpose of selecting cells that carry the 
• ■ 1 
10 integrated vector, 

! 

Additionally,, control samples can be run in parallel with the gene disruption assay, in 
order to determine whether the gene disruption procedure is working properly. For 

t \ ' " 1 

1 

example, the control .samples can be used to calibrate the gene disruption experiment so 
15 that disruption of a known non-essential bacterial gene results in an approximate number 
of colonies per plate. Similarly, the disruption of a known essential gene can be 
calibrated to yield only zero or one colony per plate. The appearance of one colony is 
due to the rare illegitimate recombination into a non-homologous sequence. In particular, 
a known non-essential gene such as the lytA gene (Tomasz, A., et al. s 1988 J. Bqcteriol 
20 170:5931-5934) can be used so that between about 70 to 100 chloramphenicol-resistant 
colonies will grow per plate. Similarly, the ftsZ gene (Lutkenhaus, J. F., et al., 1980 J. 
Bacteriol 143:1281-1288), a known essential gene, can be used to yield zero or, rarely, 
one colony per plate. As is well known in the art, specific parameters that are involved in 
any given gene disruption assay can be adjusted to calibrate the desired number of plated 
25 cells in the control samples. Experimental parameters that can be adjusted include, but 
are not limited to, the E. coli strain used to propagate the vector/insert, the fragment 
length of the sequence to be integrated, the amount of recombinant integration vector 
used to transform the cells, use of transformation-competent cells, and plating density of 
the transformed cells. 

30 
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The transformed cells carrying tlie recombinant integration vector that disrupts 
expression of an endogenous essential gene (e.g., the target ceg gene) can be identified, 

i based on a selectable phenotype such as non- viability 1 . For example, the cells that catry a 

i 

disrupted non-essential gene will be viable and, due to the integration of pEVP3, will 
5 grow on chloramphenicol-containing medium. In contrast, cells that carry a disrupted 
essential gene will not grow (e.g., non-viable) on the chloramphenicol-containing 
medium. Thus, the transformed cells that do not grow under these selective conditions 
carry an endogenous gene sequence j that is essential for cell viability which has been' 
disrupted by an exogenous candidate fragment, thereby identifying a ceg sequence. Steps 
10 one through three may be repeated [ in order to confirm that the ceg sequences, so 
identified, are essential for cell viability, 

b) Autolysin Assay 

i i 

i 

15 It is advantageous to perform additional steps to determine whether the homologous 
recombination events result in disruption of the intended target gene sequence. The lytA 
transformation control can be used to confirm that the transformation system is 
; functioning properly. For example, a phenotypic test for autolysin activity (lytA gene 
product) can be performed to determine that the exogenous lytA fragment is correctly 

20 integrated into the lytA site within the host genome. This typically involves flooding the 
culture plates containing transformants carrying the integrated lytA control vector with a 
solution of detergent, such as 0.1% deoxycholate, which triggers cell lysis in lytA -intact 
cells (e.g., the cells that have not undergone homologous recombination). After about 5- 
10 minutes the colonies with intact lytA will appear ghost-like due to cell lysis, and the 

25 colonies with a disrupted lytA gene will appear intact. 

c) Polarity Analysis 

The ceg sequences that are confirmed to be essential for cell viability can be examined 
30 further by performing a polarity analysis to determine if the corresponding endogenous 
ceg sequence is organized in an operon. Polarity is an effect unique to prokaryotes and is 
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the result of the operon organization of bacterial genomes. Many bacterial genes are 

arranged in operons in which multiple genes are under the control of a single regulatory 
1 sequence (e.g., a promoter) and are transcribed into a single, mRNA transcript. With 
respect to the orientation of multiple genes within an operon, the genes that are proximal 
5 to the regulatory sequence are said to be "upstream" genes and the genes that are distal 
are said to be "downstream" genes. For example, many operons contain genes encoding 
different proteins that catalyze discrete steps of a common biochemical pathway., Thus, 

any of the proteins that catalyze the steps of the pathway may be essential for cell 

* ' 1 1 1 

viability. j 

10 I 

The presence of operons in a bacterial host genome may influence; the interpretations of 
the gene disruption results. For exiample, disruption of an upstream gene may be 

i ! 

erroneously interpreted as affecting the expression of the disrupted gene but may, in fact, 
have expression affects on the intact downstream genes. Therefore, it is advantageous to 

i 

15 perform a polarity analysis to determine if a ceg sequence is part of an operon. 

A polarity analysis can involve performing an in vivo gene disruption procedure using* as. 
the disrupting sequence, a ceg sequence that includes the entire ceg coding sequence- 
region but lacking expression regulatory sequences. This differs from the gene disruption 
20 assay, which involves the central region of the ceg sequence. The polarity analysis 
involves gene duplication via homologous recombination. For example, the pEVP-3 
vector haying the entire coding region of a ceg sequence can be used for the polarity 
analysis (Figure 2 A). The polarity analysis will yield different results depending on the 
organization of the endogenous target sequence within the host genome. 

25. 

For example, Figure 2 shows a schematic representation of the polarity test for operons, 
within a bacterial host cell. In Figure 2A, the recombinant vector, pEVP3, includes the 
CAT gene and the entire coding region of the ceg disrupting sequence. The "X" in Figure 
. 2 indicates the recombinant pEVP3 vector undergoing homologous recombination with the 
30 target sequence. Two of the possible results of homologous recombination are shown in 
Figures 2 B and C. 
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i . < ' ' 

In Figure 2 B, case 1, if the endogenous target sequence is not organized in an operon, the 
integration event may ' yield: ' a functional ' target sequence (es.g., it is capable of 
expression); a duplicate non-functional target sequence that lacks ia promoter; and a 
5 functional downstream gene (e.g., Gene B) that is controlled by its own promoter. The 
cells carrying this type of integrated target sequence can be recovered as viable cells that 

grow in the presence of chloramphenicol; this condition is termed "polarity negative". 

\ - i ' , - ' " ; 

t t 1 i 

t i 

In Figure 2 C, case 2, if the target sequence is organized in an operon, then the integration 
10 event may yield an integration site that is similar to that described for case 1, including: a 
functional target sequence; and a duplicate non-functional target sequence which is not 
functional. However, this integration event may also yield a non-functional downstream 
gene (e.g., Gene B) because expression of this downstream gene is controlled by a 
promoter located upstream of the insertibri site. The cells that carry this type of 
1 5 integrated target sequence will be non- viable; this condition is termed "polarity positive". 
Thus, the polarity analysis provides a method to determine whether integration of a 
. recombinant vector into a target ceg sequence effects expression of downstream genes. 

* * 

The ceg sequences disclosed herein (SEQ ID NOs.: 1-113, 227-331) encode gene 
20 products that are essential for viability in S. pneumoniae. Furthermore, many of these 
ceg sequences have been analyzed for the polarity effect and the results are presented in ' 
Table I. One subset of ceg sequences is classified as polarity negative (-), since the 
homologous recombination event did not effect the expression of downstream genes. 
Another subset of ceg sequences is classified as polarity positive (+), since the 

m 

25 homologous recombination event did affect the expression of downstream genes. The 
ceg sequences that have not yet been classified as polarity positive or negative are 
indicated in Table I as a blank. For the ceg sequences that are classified as polarity 
positive, the genes downstream of the disrupted endogenous ceg sequences may or may 
riot also be essential. 

30 
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4) ASSAYS FOR IDENTIFYING CEG LIGANDS AND OTHER 
BINDING AGENTS 

1 t ' " 

The present invention' provides screening methods for identifying agents that interact 
5 and/or bind to the CEG proteins of the invention, such as a ligand. An agent can be, for 
. example, a natural product, a derived or synthetic chemical molecule, a polypeptide, a 
nucleic acid molecule, or a metal. The agents that interact with CEG proteins may cause 
bacterial cell death by disrupting the , functions of CEG proteins, including, but not 
limited to, ( nucleotide biosynthesis, DNA replication, RNA transcription, protein 
10 translation, and/or cell wall biosynthesis. Accordingly, the present invention provides 
screening methods for identifying agents having antibacterial activity, such as agents that 

i « ■ 

cause bacterial cell death by interacting 1 with the CEG proteins. These antibacterial . 
agents are useful for treating diseases and afflictions associated with bacterial infections. 

15 Various methods can be used to discover agents having antibacterial activity, as 
determined by the ability of the binding agent to bind to a CEG protein and disrupt the 

_ 1 

function of the CEG protein. These screening methods include whole cell in vivo assays 
as well as in vitro assays with cellular components. 

20 An in vivo screening method for identifying ligands that bind CEG polypeptides can be 
performed in a whole cell assay. A, typical method may be the use of whole bacterial 
cells to assess the antibacterial properties based on cell growth or viability. These 
methods can include methods for measuring cell growth and/or viability, for example, by 
optical density or zones of growth (Koch, A. L. et al., 1970 Anal Biochem. 38:252-259; 

25 Biemer, J. J. et al., 1973 Ann, Clin. Lab. Set 2:135-140; Manual of Clinical 
Microbiology; 7 th edition, Murray, P. R. (ed), ASM Press), by growth inhibition in an 
agar assay (Murray, P. R., supra), or other means of detecting cell metabolism 
(Mychajlonka, M. et al., 1980 Antimicrob. Agents Chemother. 17:572-582), and are well 
known to those skilled in the art. In addition, there are molecular biology-based detection 

30 methods for use with whole bacterial cells, such as gene reporter assays, to monitor the 
effect* of the ligand on specific targets (Slauch, J. M., et al., 1991 Methods Enzymol 
204:213-248). Examples of the reporter genes include, but are riot limited to, beta- 
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galactosidase, alkaline phosphatase, luciferase, and green fluorescent protein. For 
example, one embodiment provides a reporter system that monitors inhibition of DNA 
1 synthesis by fusing a reporter such as beta-galactosidase (lacZ) to genes known t6 be 
upregulated by the cessation of DNA synthesis as a result of the binding of ligands to the 
5 DNA synthetic apparatus. (Shurvinton, C. E., et al., 1982 Mol Gen Genetics 185:352- 
355; Rosato, A., et al., 1998 Antimicrqb. Agents Chemother. 42:1392-1396)w 

Alternatively, the yeast two-hybrid system (Fields, S. and Song, O. 1989, Nature 

| 

340:245-246) may be adapted to screen for ligands that bind CEG polypeptides. Generally, 
10 the yeast two-hybrid system is performed in a yeast host cell carrying a reporter gene, and 
is based on the modular nature of the GAL transcription factor which has a DNA binding 
domain and a transcriptional activation domain. The yeast two-hybrid system relies on 

i 

the physical interaction between a recombinant polypeptide that comprises the GAL 
DNA binding domain and another recombinant polypeptide that comprises the GAL 

i 

15 transcriptional activation domain. The physical interaction between the two recombinant 
polypeptides reconstitutes the transcriptional activity of the transcription factor, thereby 
causing expression of the reporter gene. Either of the recombinant polypeptides used in 
the two-hybrid system can be generated to include a CEG polypeptide sequence to screen 
for binding partners of CEG. 

20 

Another method uses the bacterial CEG proteins as the basis for in vitro assay systems to 
detect binding agents. Typically, the in vitro screening method comprises: a) generating 
the CEG protein of the invention, or membranes enriched in the CEG protein; b) 
exposing the CEG protein or membranes to a candidate agent; and c) detecting the 
25 interaction of the CEG protein with the agent by any suitable means. Additionally, the 
screening methods may be adapted to automated high-throughput procedures, such as 
PANDEX.RTM Baxter-Dade Diagnostics, allowing for efficient high-volume screening 
of candidate agents. 

30 An alternative method for screening potential, ligands involves an in vitro binding 
procedure. Typically, the CEG proteins of the invention can be produced using 
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recombinant DNA technology and host- vector systems as described herein. ' A candidate 
agent is introduced into a reaction vessel containing the CEG protein, of fragment 
thereof; the candidate agents may be detectable by methods such 1 as, but not limited to, 
radioisotope or chemical labeling. Binding of the CEG protein by a candidate agent can 
5 be determined by any suitable means, including, for example, quantifying bound label 
versus unbound label using any suitable method. Binding of a candidate agent may also 
be detected by methods similar to an alternative physical method disclosed in U.S. Patent 
No. 51,585,277.' In this method, binding of a candidate agent to a protein is assessed by 
monitoring the ratio of folded protein to unfolded protein, for example by monitoring 
10 sensitivity of the protein to a protease, or amenability to binding of the protein by a 
specific antibody against the folded state of the protein, or binding to chaperone protein, 
or by binding to any suitable surface. ' ■ 

The invention provides methods of identifying compounds that modulate (e.g., activate or 
15 inhibit) the function of a CEG polypeptide. Essentially any compound can be used in, the 
assays of the invention. The preferred compounds are those that are soluble in aqueous 
or organic solutions. It will be appreciated by those of skill in the art that there are many 
commercial suppliers of chemical compounds that can be used in the methods of the 
invention, including Sigma Chemical Co. (St. Louis, Mo.), Aldrich Chemical Co. (St. 

* 

20 Louis, Mo.), Sigma-Aldrich (St. Louis, Mo.), Fluka Chemika-Biochemica Analytika 
(Buchs, Switzerland), and the like. 

The present invention provides methods for detecting compounds which are identified as 
modulators of CEG function. The methods of the invention can be performed using 

25 isolated CEG polypeptides, or use whole cells expressing the CEG polypeptide. The 
steps . of the method using isolated CEG polypeptides include: contacting the isolated 
CEG polypeptide with a candidate compound; and determining whether the function of 
the CEG polypeptide is altered. The steps of the method using whole cells include: 
contacting the whole cells with a candidate compound; and determining whether the cell 

30 dies, indicating the compound inhibited the function of a CEG polypeptide. 
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The preferred methods of the invention provide high-throughput screening assays for 
identifying compounds which modulate the function of a CEG polypeptide. The high 

throughput methods permit screening of large libraries of compounds. For example! the 

• , i 

high throughput methods can use automated assay steps. ' The assays can be performed 
in parallel on a solid support, as microtiter formats on microtiter plates in robotic assays 
are well known. A preferred embodiment of the methods includes adapting the methods 
to use microtiter plates or pico- nano- or micro-liter arrays. In high throughput assays it 
is desirable to run positive controls to, ensure that the components 1 of the assays are 
working properly. 

' i ..• i 

! 

j i 
' * ■ 

The high throughput screening methods of the invention include .providing a 
combinatorial library containing a large number of compounds (candidate modulator 
compounds) (Borman, S, C. & E. News, 1999, 70(10), 33-48). Such combinatorial 
chemical libraries can be screened in one or more assays to identify library members 

i 

i 

(particular chemical species or subclasses) that exhibit the ability to modulate the 
function of the CEG polypeptide (Borman, S., supra; Dagani, R. C.&E. News, 1999, 
70(10), 51-60). The compounds, so identified, can serve as lead-compounds or can 
themselves be used as potential or actual therapeutics. 

* 

A combinatorial chemical library is a collection of diverse chemical compounds 
generated by using either chemical synthesis or biological synthesis, to combine a 

■ 

number of chemical building blocks, such as reagents. For example, a linear 
combinatorial chemical library, such as a polypeptide library, is formed by combining a * 
set of chemical building blocks (amino acids) in every possible way for a given 
compound length (i.e., the number of amino acids in a polypeptide compound). Millions 
of chemical compounds can be synthesized through such combinatorial mixing of 
chemical building blocks. 

Preparation and screening of combinatorial chemical libraries is well known to those of skill in 
the art. Such combinatorial chemical libraries include, but are not limited to, peptide libraries 
(see, e.g., U.S. Pat. No. 5,010,175, Furka, Int. J. Pept. Prot. Res., 1991, 37:487-493 and 
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Houghton, et al., Nature, 1991, 354, 84-88). Other chemistries for generating chemical 
diversity libraries can also be used. Such chemistries include, but are not limited to, peptoids 
.' (PCT Publication No. WO 9i/19735); encoded peptides (PCT Publication. WO 93/10242); 
random bio-oligomers (PCT Publication No: WO 92/00091); benzodiazepines (U.S. Pat No. 
' 5 5,288,514); diversomers, such as hydantoins, benzodiazepines and dipeptides (Hobbs, et al., 
Proc. Nat. Acad. Sci. USA, 1993, 90, 6909-6913); vinylogous polypeptides (Hagihara, et al., J. 
Amer. Chem. Sod. 1992, 114, 6568); nonpeptidal peptidomimetics with Z>eta-D-glucose 
scaffolding (Hirsclimann, et al., J. Amer. Chem. Soc, 1992, 114, 9217-9218); analogous 

, • * i 

organic syntheses of small compound libraries (Chen, et al., J. Amer. Chem. Soc, 1994, 116, 
10 2661; Armstrong, et al. Acc. Chem. Res., 1996, 29, 123-131); or small organic molecule 
libraries (see, e.g., benzodiazepines, Baum C&E News, 1993, Jan. 18, page 33,); 
oligocarbamates (Cho, et al., Science, 1993, 261, 1303); and/or peptidyl phosphonates 

t t 

+ « r p 

(Campbell, et al., J. Org. Chem. 1994, 59, 658); nucleic acid libraries (see, Seliger, H et al., 
Nucleosides & Nucleotides, 1997, 16, 703-710); peptide nucleic acid libraries (see, e.g., U.S. 

i 

15 Pat. No. 5,539,083); antibody libraries (see, e.g., Vaughn, et al., Nature Biotechnology, 1996, 
14(3), 309-314 and PCT/US96/10287); carbohydrate libraries (see, e.g., Liang, et al., Science, 
1996, 274, 1520-1522 and U.S. Pat No. 5,593,853, Nilsson, UJ, et al., Combinatorial 
Chemistry & High Throughput Screening, 1999 2, 335-352; Schweizer, F; Hindsgaul, O. 
Current Opinion In Chemical Biology, 1999 3, 291-298); isoprenoids (U.S. Pat. No. 

20 5,569,588); thiazolidinones and metathiazanones (U.S. Pat. No. 5,549,974); pyrrolidines (U.S. 
Pat. Nos. 5,525,735 and 5,519,134); morpholino compounds (U.S. Pat No. 5,506,337); 
benzodiazepines (U.S. Pat. No. 5,288,514); and other similar art. 

Devices for the preparation of combinatorial libraries are commercially available (see, 
25 e.g., 357 MPS, 390 MPS, Advanced Chem. Tech, Louisville Ky., Symphony, Rainin, 
Woburn, Mass., 433 A Applied Biosystems, Foster City, Calif, 9050 Plus, Millipore, 
Bedford, Mass.). In addition, numerous combinatorial libraries are themselves 
commercially available (see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, 
Tripos, Inc., St. Louis, Mo., ChemStar, Ltd., Moscow, RU, 3D Pharmaceuticals, Exton, 
30 Pa., Martek Bio sciences, Columbia, Md., etc.). 

* 
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In the high throughput methods of the invention, several thousand different candidate 

i , ■ j 

compounds can be screened in a relatively short period of time. For example, each well 
of a microtiter plate 'can be used to run a separate assay against a selected potential 
modulator, or if concentration or incubation time effects are to be observed, every 5-10 
wells can test a single modulator. Thus, a single standard microtiter plate can assay about 
100 (96) modulators. If 1536 well plates are used, then a single plate can easily assay 
from about 100 to about 1500 different compounds. It is possible to assay many different 
plates, per day; assay screens for up to about 6,000-20,000, and even up to about 100,000- 
1,000,000 different candidate modulator compounds ' are possible using the methods of 
the invention. 



15 



25 



The following examples are presented to illustrate tble present invention and to assist one of 

■ ! ■ , , ■ "I 

ordinary skill in making and using the same. The examples are not intended in any way to 
otherwise limit the scope of the invention. 



EXAMPLE 1 



The following provides a general description of how a list of candidate ceg sequences 
was generated. The list was generated by selecting candidate ceg gene sequences from a 
20 Concordance web engine using the method described in: Bruccoleri, R.E., Dougherty, 
T.J., Davison, D.B. (1998) "Concordance analysis of microbial genomes" in: Nucleic 
Acids Res 26:4482-4486. 



Microbial Genomics CEG Discovery Process Summary. 



Microbial Concordance Analysis 



30 



The entire genomic sequence data of various bacteria was acquired from several public 
and proprietary sequence database sources, including GTC (Genome Therapeutics 
Corporation), and TIGR (The Institute for Genomic Research). 
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Predicted ORFs from the genomic data were identified, translated, ; and stored. % The 
desirable ORFs were at least 90 amino acid residues in length. Concordance analysis 

was performed among bacteria arid various parameters were used to filter out genes with 

1,1 i 1 'i 1 

high similarity to eukaryotes. 

Concordance Analysis i • . , 



The entire genomic sequence of various Eubacteria was . acquired from several public and 
private sources. The proprietary PathoGenome System from Genome Therapeutics 
10 Corporation, Waltham, MA, USA contributed data. Public data was obtained from 
GenBank Oittp://ncbi.nlm.nih.gov), The Institute for Genomic Research (TIGR), the 
Yeast Proteome Database, from Proteome, Inc. of Beverly, MA,, and the Sanger Center of 
the Medical Research Council of the United Kingdom (http://www.sanger.ac.uk). 
Additionally, the non-microbial sequence data used as a basis for comparison and data 

i • 

15 subtraction was obtained from a proprietary database, including the LifeSeq Database 
from Incyte Pharmaceuticals, Palp Alto, CA. 

Where required, Incyte nucleotide sequences were translated into protein sequences in all 

* 

six possible reading frames. GTC supplied predicted protein sequences with their data. In 
20 the case of other eubacterial nucleotide sequences, the program CRITICA (Badger, J. and 

■ * « 

Olsen, G., 1999 "CRITICA: coding region identification tool invoking comparative 
analysis" in: Molecular Biology and Evolution 16:512-524). The sequences were stored 
in flat files on a Unix computer system. Each predicted amino acid sequence had to be 
greater than 90 amino acids. 

25 

Each predicted protein sequence was compared to every other sequence (an "all-agairist- 
all" comparison). The program used was FASTA (Pearson, W.R., "Flexible sequence 
similarity searching with the FASTA3 program package." Methods in Molecular Biology 
2000 132:185-219.) The parameters used were ktup-2, and all scores above the default 
30 cutoff were kept. The output was processed and stored in a PostGres 95 database 
(http://www.postgresql.org) . Graphical user interfaces, using web browser technology, 
were constructed to query the database. 
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A Concordance Analysis was performed on the data. The question used to generate the 
dataset was show all Streptococcus pneumoniae open reading frames with a similarity 
5 greater than or equal to 30% overall protein sequence identity to both selected gram- 
positive and/or gram-negative bacteria in the database. The data was further required not 
to match yeast or human sequences at greater than' 30% overall protein sequence 
... similarity. The resulting dataset included a list of more than 400 conserved amino acid 
sequences having known or unknown function. The amino acid . sequences having 
1 0 unknown functions formed the basis of a list designated Conserved Unknown Reading 

i 

Frames, or CURFs which is a subset of the total list of CEGs (e.g., CURFs includes 
known and unknown). 

* i 

The resulting list of conserved genes (e.g., more than 400 sequences) was used as a basis 

i 

15 for selecting and screening bacterial gene sequences that are essential for cell viability. 
The Concordance system was designed to permit high-throughput identification of 
conserved gene sequences in the database. (Bruccoleri, R, Dougherty, T, and Davison, D. 
1998 "Concordance analysis of microbial genomes" Nucleic Acids Res. 26:4482-4486.) 

20 Data Curation And Analysis 

Exact N-terminal and C-terminal translational start sites of genes were identified by 
pairwise similarity searches, multiple sequence alignments. Ribosome binding sites, 
terminators, nearby genes, operons were identified. 

25 

The resulting list of conserved genes was used as a basis for selecting and screening 
bacterial gene sequences that are essential for cell viability. This Concordance system 
was designed to permit high throughput use of the conserved gene sequences contained 
on the list. A set of Knockout PCR primers were generated, based on the list of 
30 conserved genes, for the purpose of use in the gene disruption procedure described 
below. The PCR primers were designed to amplify a central 300-500 bp region of the 
ceg (to prevent generation of a functional copy of the ceg gene following integration), 
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ordered electronically, the primers were placed in a 96- well format, and used, in the gene 

disruption procedure as described below. 

i . . 'i 

EXAMPLE 2 ' 1 

J * 

The following provides a description of the procedure to generate recombinant vectors of 
pEVP-3 having inserts of candidate ceg nucleotide sequences. The Knockout, primers 
generated by the method described in Example 1 above were used to generate DNA 
fragments comprising candidate ceg sequences. 
10 i 

Genomic PCR Knockout Target Fragment Generation 

i 1 

96-well plate format were set up (36 jxl H2O , 5 jil 10* Vent™ buffer, 1 nl gene specific, 

i , • 

knockout forward primer (0.5 M-g/fil), 1 (J-l gene specific knockout reverse primer (0.5 
15 \ig/\il), 0.5 p,l Vent™ DNA polymerase (2000 U/ml New England Biolabs, Beverly, 
MA), 1.5 \i\ each dNTPs (lOmM; 6.0 jal total), 0.5 (il S. pneumoniae chromosomal DNA 
(0.5 ng/nl), 50 |il total volume/reaction). 

The nucleotide sequences of the forward and reverse knockout primer pairs were 
20 generated from the nucleotide sequence information obtained from the Genomic 
Therapeutics Corporation database for Streptococcus pneumoniae. The primer pairs were 
each used in a PCR reaction to generate a unique internal (e.g., central region) fragment 

■ 

of the candidate gene targeted for knockout. 

25 The PCR program was set in the PCR machine (Initial 95 °C - 5 minutes: 30 Cycles of: 
95 °C - 1 minute, 58 °C - 1 minute, 72 °C - 30 seconds; Final, 72 °C - 10 minutes, 4 °C - 
hold indefinitely). 5 |nl of each reaction was run on an 0.8% agarose gel after purifying 
fragment over PCR purification kit (Qiagen) to visualize the fragments then ligation 
reactions were performed. 

* 

30 

* 
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i .1 

Ligation Reactions , proceeded (set up in 96-well plate format (10.0 \il genomic PGR 
fragment (generated from step 2 above), 1.0 ^1 pEPV-3 Smal-cut vector (1:10 dilution of 
vector DNA at 50-iod ng/|al), 1.5, \il 10* ligation buffer (New England Biolabs™), 1.0 [il 
T4 DNA Ligase (New England Biolabs^ 400,000 U/ml), 1.5 fil ddH 2 O t 15.0 \il total 

i 

5 reaction volume).* 

;!•.-■•! 1 

Reactions were allowed to incubate in 96-well plate at ,14 °G overnight in the PCR 
machine. Transformations, into E. coli for in vivo amplification were proceeded the 
following day. 1 ' 

10 , 

r 

The nucleotide sequences of the forward and reverse primer pairs used for the polarity 

1 1 
test were generated in a similar manner, from the nucleotide sequence information 

obtained from the Genomic Therapeutics Corporation database for Streptococcus 

■ pneumoniae. The primer pairs were each used in a PCR reaction to generate a unique 

1 5 fragment of the candidate gene targeted for the polarity test. The fragment generated for 

the polarity test included the entire ceg coding sequence region but lacking the expression 

i 

regulatory sequences. 

Transformation into E. coli (strain LE392): 

20 

The next day, 3 \xl of above ligation mix was used per transformation reaction plus 50 \il 
LE392 competent cells. Reactions were set up in 96-well plate format; incubated on ice 
for 30. minutes; heat-shocked at 42° C for 90 seconds; and incubated on ice 2 minutes; 
100 jal SOC media (Gibco BRL) was added; then incubated at 37° C on platform shaker 
25 for 1 hour; plated on LB/chloramphenicol (13.0 \xg/ml) agar plates for constructs over 
night at 37° C with plates inverted and proceeded with colony PCR to confirm constructs. 
The universal primers flanking the insert site in pEVP-3 were used for PCR 
amplification. 

30 The colony PCR involved the following. 96-well plate format was set up (36.5 |iil H2O, 
0.5 \xl pEPV3 forward primer (0.25 |ig/|al), 0.5 jil pEPV3 reverse primer (0.25 fig/^il), 1.5 
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fil each (6.0 total) dNTPs (10 mM), 0.5 |xl Vent™ DNA polymerase, 5 jal 10x Vent™ 

m * 

buffer, 1 ul of a 1:50 cell dilution, 50 ul total volume). 

pEPV3 forward primer: 5' CATC AAGCTTATCGATACCGTCG 3 ' (SEQ ID NO:437) 
p EPV3 reverse primer: 5' CACAGTAGTTCACCACCTTTTCCC 3' (SEQ ID NO:438) 



10 



Colonies of E. coli LE392 were picked onto a master plate- ,of LB + '13,ug/ml 
chloramphenicol (incubate throughout the day at 37° C) and then into 50 pi H2O which 
has been placed into a 96-well plate. 1 |il of this dilution was used in above PCR reaction 
(if the 96-well dilution plate is kept you will not need to prepare a master plate): Cultures 
for minipreps of plasmid candidates may be prepared directly from the cell dilutions. 



15 



20 



The PCR program was run (95 °C - 5 minutes, 30 Cycles of: 95 °C - 1 minute, 58 °C - 1 

! 1 

minute, 72 °C - 30 seconds, 72 °C - 10 minutes, 4 °C - hold). 

A 10 \iU reaction was run on a 1.0 % TBE gel. A gel designed for 96 well plates and a. 
multichannel pipettor were used to ease loading of the sample rows. The gel was run and 
stained with ethidium bromide. The positive clones were identified with appropriate 
molecular size insert(s), amplified by the flanking pEVP-3 primers. 

Minipreps Of Plasmids To Identify Cells Carrying The Pevp-3 Vector With An Insert 



The constructs that carried an insert were identified. The constructs having an insert 
were inoculated into a 5 ml LB/Cm culture, and incubated over night at 37 °C with 
25 aeration. Miniprep plasmid DNA was prepared by a standard procedure. The miniprep 
. DNA was digested with appropriate restriction enzymes to confirm the presence of the 
insert (enzymes flank Smal site in pEVP-3) (10 ul miniprep DNA, 2 ul 10 x buffer, 1 ul 
Xbal, 1 ul Xhol, 6 ul ddH20,' 20 ul total volume for digest). 
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To confirm the presence of an insert, the digest reactions were electrophpresed on an 
agarose gel and the gel was stained with ethidium bromide. The positive clones were 

used for the S. pneumoniae KNOjCKOUTs procedure. ' , 

* 1 ' i 

i ■ i 

5 The confirmatory PGR reactions, using knock out-specific primers (quality control step) 
involved 35.5 ul HfeO, 5 \il 10<x Vent™ buffer, 1 \il knockout forward primer (0.5 M-g/jil), 
1 |il knockout reverse primer (0.5 |ig/|il), 0.5 ^il Vent™ (6.0 jxl total) DNA Polymerase 
(2000 U/ml), 1 1.5 ill each, dNTPs (lOmM, 6.0 \i\ total), 1.0 ul miniprep DtiA from test 
clone, 50 |al total reaction volume. The PCR program was as follows: 95 °C for 5 
10 minutes, 30 Cycles of: 95 °C for 1 minute, 60 °C for .1 minute, 72 °C for 30 seconds, 72 

'it , 

°C for 1 0 minutes, hold at 4 °C. The presence of the correct-sized insert was confirmed 

* 

by agarose g^el electrophoresis and ethidium bromide staining. The confirmed clones 
were used for the S. pneumoniae gene KNOCKOUT procedure. Glycerol stocks were 
made of all positive E. coli LE392 constructs and frozen at - 80 degrees C. 

15 

EXAMPLE 3 

i 

The following provides a description of the high throughput gene disruption procedure 
used in S. pneunomiae strain (e.g., gene knockout procedure). The candidate ceg 
20 fragments that were generated by the method described in Example 2 were used in the 
gene disruption procedure in order to identify ceg nucleotide sequences that are required 
for cell viability. 

J 

Reactions were set up in a 1.5 ml eppendorf tubes or 96 well plate (1 jag total of miniprep 
25 pEVP-3 + insert DNA (usually 10 \s\ of Qiagen miniprep DNA); then 200 ^il of 5. 
pneumoniae (strain Rx-1) competent cells diluted 1:10 in competence media was added 
(1 ml of competence media = 980 jjtl Todd Hgjvitt (Difco Laboratories) with 0.5% yeast 
extract, 20 ^1 10% BSA, 1 nl 10 % CaC12, and 0.5 |al (200 |ig/ml) Csp-1 competence 
peptide). 

30 
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Controls were run with each KNOCKOUT experiment and involved 1 pEPV3 Lyt A 

construct = positive control (non-essential), or 1 |ig pEPV3 Fts Z construct = negative 

1 i i . . • < , .\ , , . . . 

control (essential). Then the 96 well plates and controls were incubated at 37 °C for 2.5 
to 3 hours in 37 °C room without shaking. The 200 |il of thfe samples were plated on 
5 Todd Hewitt agar plates with 0.5% yeast extract and 2 |ig/ml chloramphenicol. 

The samples were incubate over night at 37 °C in 5% CQ2 incubator/ Control plates were 
checkbd for presence of colonies (pEVP-3::lytA) and no growth (pEVPr3::ftsZ). Plates 
were examined for, growth (ca. 70-150 colonies) designating nonessentials and zero 
10 colonies designating essential genes. 

1 < 

1 

1 ♦ 

The polarity test, was performed in a similar manner, using the polarity fragments, 
described in Example 3 . , 

15 

EXAMPLE 4 

The following provides a description of the autolysin procedure used to determine that 
the non-essential control samples of S pneumoniae contain a disrupted lytA gene. r 

20 

Phenotypic Autolysin Test 

The culture plates containing transformants carrying the lytA control vector were flooded 
25 with 0.1% deoxycholate in H2O. The plates were observed after 5-10 minutes. Plates 
with "ghosts" indicated intact lytA gene, or plates without "ghosts" indicated a disrupted 
lytA gene. The "ghost" phenomenon is due to detergent triggered autolysis of the cells, 
causing a gradual fading of the colonies. 

30 The detergent treatment triggers the autolysin in lytA intact cells; it cannot trigger the 
autolysin (lytA gene product) in lytA disrupted cells. Colonies with intact lytA "ghost" in 
5-10 minutes due to massive pneumococcal cell lysis. 
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» 

■ i ( i 

EXAMPLES 

I ■ . . ' . 1 ' , 1 I 

The following provides a description of the procedure used tp express the CEG proteins 
5 (e.g., designated CFE proteins) in E. coli cells. 

■ i 

CEG Protein Production 

■ ■■' . , . 1 t 

i ♦ i 

Full-length ceg gene were inserted into pET-21 expression vector using the E. coli BL21 

i 

10 XDE3 expression system using the following method: 

For each ceg, custom primers were used to insert N- and C- termini into vectors such that 
the 5' end (N-terminus of the CEG) is positioned properly for expression behind the 17 
promoter and optimally placed with regard to the pET ribpsome binding site. The pET 

15 vectors contain an Ndel site which allows positioning of ATG start site in the vector. In 
cases where the ceg sequence contains an internal Ndel site, blunt ligation of the ceg PCR 
fragment into the vector is accomplished via Klenow fill-in of the Ndel site. In many 
cases, primers were also designed such that the ceg 3 9 (C-terminus of the expressed 
protein) will contain an in-frame extension of 6X-histidine residues, encoded in the 

20 vector sequence of pET-21. The individual cegs were PCR amplified via custom 

+ 

designed primers as described above. Both ceg PCR and vector DNA were digested with 
appropriate restriction enzymes. The full-length ceg were ligated into the pET 

m 

expression vector. The ligation mixture was transformed into competant E. coli BL21 
^DE3 cells and selected for transformants on LB agar with 50 jag/ml ampicillin. Positive 
25 insert bearing clones were screened via minipreps of the plasmids and size analysis on 
0.8% agarose gels, with detection by ethidium bromide staining, as above. 

Protein Production 

30 The proper reading frame of each ceg inserted into pET-21 is verified by DNA 
sequencing. 

* 
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4.1 

; ■ ■ 

A small (2-5 ml) test culture of E. coli BL21 X.DE3 with the insert-bearing plasmid is 
' tested for protein expression by IPTG induction of the expression vector for, 1-2 hours. 
The expression is verified by SDS-Pplyacrylamide Gel Electrophoresis analysis of a • 
5 whole cell extract (SDS extract of 0.5-1 ml of cells treated at 100 °C for 5 minutes) to 
determine whether the protein is over-expressed and migrates at the correct predicted 
molecular weight. ■ ■ . 

I 

The protein is overproduced and purified, via the following method. A large scale (500- 
10 1000ml) culture of K coli is grown to early logarithmic phase in broth (e.g., LB broth) 
and protein expression induced for 2 hours with IPTG (isopropyl-D-thiogalactoside). 
The cells are harvested by centrifugatibn (8000 X G; 15 minutes) and the cell pellets 

resuspended in 20 ml. of buffer. The cells are lysed by sonication, and the supernatant 

"I 

fluid centrifuged at low speed (5000. X G, 15 min.) to remove unbroken cells. The 
15 supernatant fluid, containing the over-expressed protein is subjected to Ni- NTA affinity 
column chromatography (Quiagen, Inc., Chatsworth, CA). The 6X-histidine residues 
linked at the C-terminal end of the CEG proteins permit rapid protein purification via 
selective binding to a Ni-NTA resin column. The protein-bound, Ni-NTA resin was to 
remove contaminants, and the bound proteins subsequently eluted with imidazole and 
20 recovered. It is possible to upscale this procedure to larger volumes for higher yields of 
proteins. 

EXAMPLE 6 

25 The following provides a description of the methods used to purify all 2CEG 
polypeptides (e.g., 2CFE polypeptides #19-117; SEQ ID NOS:349-436) having a 
histidine tag at their G-terminal ends. The 2CEG polypeptides having the his-tags were 
produced by the methods described in Example 5, supra. As an example, results of 
purification of 2CFE 75 polypeptide are presented. 

30 
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i 

Production OfThe CFE Polypeptides , 

i . ■ , 

• t 

t 

. i 

The BL21XDE3 cells harboring recombinant pET-21 vectors carrying a 2CFE nucleotide 

' i 

sequence (SEQ ED NOS:244-331) were . cultured in LB broth containing ampicillin. 
5 When the A^oo reached approximately 0.6, protein production was induced by adding 1.0 

% 

* * i * 

mM of IPTG, the cells were cultured for an additional 2 hours. The cell pellet was 
collected by centrifugation; and the collected cell pellet was sonicated in Solution A (50 
mM NaP0 4 ; 300 mM NaCl, pH 8.0). '"The sonicated cells were centrifuged at 10,000 
RPM to remove the debris. ... 

10 . 

Purification OfThe CFE Polypeptide 

The supernatant was diluted with Solution A, loaded onto a Ni-NTA column (Quiagen) 

j * ' i 

equilibrated with Solution A; the column bed size was 2.5 x 25 cm, and the flow rate was 

i 

15 approximately 3.0 ml/minute. The 2CFE protein was eluted using a linear gradient of 
imidazole, using 0-250 mM in 450 ml, flow rate approximately 3.0 ml/minute. The 
eluted samples were collected as 22 ml fractions per tube and the eluted samples were 
monitored using spectrophotometry. The amount of protein in the eluted fractions was 
estimated using the Bradford method (Bradford, M. M., 1976 Anal Biochem. 72:248) and 

20 the samples were run on an SDS-PAGE gel (Novex EC6008) (Figure 3 A). Fractions 
were selected for pooling based on the results of the SDS-PAGE gel. The pooled 
fractions were concentrated using a 10,000 MW Centricon (Amicon) to approximately 5 
ml. 

25 The 2CFE 75 polypeptide, a precipitate formed and was redissolved upon increasing the 
sample volume and removing the imidazole by repeated concentration in 50 mM Tris, 
100 mM NaCl, pH 7.5. Varying amounts of the 2CFE 75 polypeptide were diluted in 
either 20 mM Tris, 20 mM KC1, pH 7.5 or 20 mM Tris, 20 mM MgCl 2 , pH 7.5 at 
* concentrations of 12, 24, or 36 ug/mL The diluted samples were electrophoresed on an 

30 SDS-PAGE gel under non-reducing conditions (Figure 3 B). The results of Figure 3 B 
suggests that 2CFE 75 forms a multimer. 
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EXAMPLE 7 

• t • • , 

i < j • 

. , ( i , ' ( 

The following provides a description of the methods used to purify CEG polypeptides 
5 that lack a histidine tag (e.g., 2CFE polypeptides #1-17; SEQ ID NOS:332-348). As an 

example, the results of purification of CFE 3 .polypeptide are presented. 

' , ' t . ■ , - • 

Purification of the CFE 3 Polypeptide ' 1 , 

■ .■ * 1 

10 The 2CFE 3 polypeptide was produced using the large scale IPTG-induced method 
described in Example 5, supra. The 2CFE 3 (SEQ ID NO:334) polypeptide lacks a C- 
terminal histidine tag. The 2CFE 3 polypeptide was purified using a 2-column 
procedure. The 2CFE 3 polypeptide preparation was eluted from a 26/10 Q Sepharose 
column (Pharmacia) , using a 0-1.0 M NaCl gradient, 2 ml/minute flow rate, and the 

1 5 gradient size was 1 liter. Then the 2CFE 3 polypeptide was eluted from a hydroxyapatite 
Bio-gel column (Bio-Rad) using a 5-200 mM potassium phosphate (pH 8.0) gradient, the 
flow rate was 0.3 ml/minute, and the gradient size was 300 ml. A sample of the 2CFE 3 
preparation was run on a polyacrylamide gel (Figure 4). 

20 EXAMPLE 8 

The following provides a description of the size exclusion chromatography methods used 
to estimate the molecular weight and determine whether the CEG polypeptides 
oligomerize. The CFE polypeptide may olimerize to form monomers, dimers, tetramers, 
25 hexameric rings, or other oligomeric forms. 

Size exclusion chromatography was performed on all isolated 2CFE polypeptides #s 1- 
117 (e.g., SEQ ID NOS:332-436). This method was performed using various types of 
columns, depending on the particular 2CFE polypepeptide tested. 

30 
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The Biosil SEC- 125 HPLC Gel Filtration column (BioRad Laboratories, Inc) was used, 

* i i 

for example, to characterize CFE 8. The mobile phase was 0.2 M KH2PO4, 0.9% NaCl 

> pH 6.8. • ' . ' '; '' ' 1 ' 

5 The Phenomenex 600 x 7.5 mm Biosep SECS 3000 column was used, for example to 
characterize 2CFE 21 and 39. This mobile phase for size exclusion was 50 raM 
Na 2 HP0 4 , pH 7.0 and 150 mM NaCl run at 1 ml/minute in a Gilson HPLC system, with 
protein detection at 280 nm. 

, ■ • " i 

i 

i 

10 EXAMPLE 9 j • i . 

i 

The following provides a description of the computer-aided methods used to search for 

i ; 

similarities between the amino acid sequences of the CEG polypeptides and sequences 

i 

available through public and proprietary databases. In many cases, the function of the 

! i 

15 CEG polypeptides was suggested by the results of the similarity searches. The function 
of some of these CEG polypeptides has been confirmed by performing additional 
analyses. Table V provides a list of the suggested and confirmed . functions of CEG 
polypeptides designated CFEs #1-117. 

20 The- putative function of the CFE polypeptides were determined using computer-aided 
bioinformatic approaches, including distant homologies, motif searching, or predictions 
based on statistical rules. For example, the distant homology approach involved pairwise or 
multiple sequence alignments, employing tools such as FASTA, and Psi-BLAST. The 
motif searching approach involved using sophisticated hidden Markov models. The 

25 approach based upon predictions of statistical rules involved prediction of transmembrane 
regions, coiled-coil, and other structural motifs. These approaches have been reviewed in 
Computational Methods In Molecular Biology 1998, eds. Salxber, S.L., Searls, D.B. Searls, 
and Kasif, S. , Elsevier, and in Bioinformatics: A Practical Guide To The Analysis Of Genes 
And Proteins 1998 eds Baxevanis, A. D. and Francis Ouellete, B.F. , Wiley-Interscience. 

30 
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Global sequence similarity searches were performed using the amino acid sequences of 

all the conserved essential gene sequences (e.g., CFEs 1-117; SEQ ID NOS:l 14-226) to 

•I , i i • ■ . i .- - < ' 

search against a non-redundant protein database using the BLAST2 algorithm (Altschul 
S.F., et al., 1997 "Nucleic Acids Res. 25(17):3389-3402). In a similar search, similar 

5 sequences were identified in the Concordance database using the "Neighbor" function 

' ■■ • 

(Bruccoleri R. E., Dougherty T. J., Davison D.B. 1998 Nucleic Acids 7tes.'26(19):4482- 
4486). To determine if the predicted amino acid sequences were full, length and in the 
proper' readirig' frame, BLAST-type searching and CLUSTAL multiple sequence 
alignments (Higgins. b.G., et al., 1996 Methods Enzymot 266:383-402) were used. 
10 Local sequence similarity searches were performed, by searching for Prosite (Hofmann 
K., et al., 1999 Nucleic Acids Res. 27(1):215-219) and Pfam motifs (Bateman A., et al., 
2000' Nuclei^ Acids Res. 28(l):263-266). Additionally, the amino acid sequences of the 
CFEs were analyzed by performing protein threading analyses using the ProCeryon fold 
recognition program (Sippl, et al., 1992 Proteins 13:258-271; Sippl, J. 1993 J. Comp. 

15 Aided Mot Design 7:473-501 ; www.proceryon.com) and Geneformatics. 

i 

In bacteria, many operons include genes encoding different proteins that catalyze discrete 
steps of a common biochemical pathway. Therefore, the operon structures in S. 
pneumoniae was compared with that in other bacteria in order to predict the function of 
20 CFE polypeptides. 

Additionally, analysis of bacterial metabolic pathways were performed using Pathway 
Tools from DoubleTwist, based on the EcoCyc system (Karp P.D., et al., 1999 Nucleic 
Acids Res. 1999 27(l):55-58). This analysis was used to predict which CFEs mediate 
25 various steps of the pathways. 

When the sequence identity between a CFE polypeptide and the annotated database (e.g., 
SwissProt, Genbank) was low (e.g., sequence identity less than about 30%), a Protein 
Threading (e.g., fold recognition) method was used to predict similarities in the folded 
30 protein structure of CFE polypeptides in the absence of a high level of sequence similarity 
"with proteins in the databases (review by Teichmann, et al., 1999 Current Opinion in 
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Structural Biology 9:390-399). The Protein Threading method predicts the compatibility of 

a query sequence (e.g., CFE polypeptide sequences) with each of the folds in a library of 
, known protein structures. The library of known protein' structures as developed, maintained^ 
and updated throughout the search process. ' , 

5 

A list of potential structural folds, onto which each query was compatible, was, generated for 

all CFE polypeptides (e.g, SEQ ID NOS:l 14-226). The fold assignments for each query 

were used to generate pairwise sequence alignments. The pairwise sequence alignments ' 

i * 
were used to generate protein models of the query polypeptide (e.g., CFE polypeptides). 

10 i ' • 

The pairwise sequence alignments were also used to compare the position of critical 

residues of the structural template with the query polypeptide. The list of critical residues 

I t 

was generated by using multiple sequence alignments derived from a structural 

i 

classification of proteins to generate ja conservation profile which provided sequence- 
15 specific positions conserved across a homologous family of protein folds. Comparative 
modeling was used to search the model of the query polypeptide for the critical residues and 
determine whether the structural and functional motifs are conserved in the query protein. 
Conservation of structural and functional motifs permitted assignment of putative structure 
and function to a query polypeptide sequence. 

20 

The Protein Threading method was used to search for putative folded structure and function 
for all CFE polypeptides (SEQ ID NOS:l 14-226). The CFE polypeptides having significant 
sequence identity (e.g., more than 30%) to known proteins were assigned putative functions 
with a high level of confidence. 

25 

EXAMPLE 10 

The following provides a description of the methods used to characterize purified, CFE 
101 polypeptide. The 2CFE ; 101 polypeptide mediates the conversion of pantothenate to 
30 4' phosphophantothenate, and is predicted to be a pantothenate kinase. 
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Computer-Aided Comparison 

■ 

The computer-aided comparison, as described in Example 9, supra, suggests' that' the 
amino acid sequence of the CFE.101 polypeptide (SEQ ID' NO:210) is 42% similar to the 
amino acid sequence of the coaA protein of E. coli Thus, CFE 101 may be a 
pantothenate kinase, which mediates the conversion of pantothenate to 4' 
phosphophantothenate (Figure 5). 



Circular Dichroism and Circular Dichroism Thermal Melt Analysis 

j 

Circular dichroism and circular dichroism melt methods were used to determine the 
folded structure of the expressed and isolated 2CFE polypeptides. For example, this 
method was used to characterize the folded structure of isolated 2CFE 101 (SEQ ID 
NO:421). | 1 



The starting concentration of the 2CFE 101 polypeptide was such that OD205 was 
approximately 1.5, and the OD280 was approximately 0.05 (e.g., 0.05 to 0.1 mg/ml). The 
starting concentration of 2CFE 101 was approximately 344 jaM in 50% glycerol, 50 mM 
Tris, 100 mM NaCl, 5 mM MgCl 2 , 0.5. mM EDTA, at pH 7.5. The polypeptide was 
20 diluted to a final concentration of 7 jjM, as determined by absorbance at A28o> in 20 mM 

* 

Na-phosphate, 100 mM KC1, at pH 7.0. The circular dichroism analysis was performed 
using quartz cuvettes, the instrumentation was from JASCO (Model J-720), the readings 
were performed at 25 degrees C (Figure 6 A). The band width was 1 nm, the sensitivity 
was 20 mdeg, the response was 0.25 seconds, the scan speed was 50 nm/minute, and the 
25 step was 0.5. The circular dichroism thermal melt analysis was performed at a range of 
between 0 and 100 degrees C (Figure 6 B). Additionally, the circular dichroism was 
performed comparing monomer and aggregate pools of 2CFE 101. 
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Size Exclusion Analyses , 

I • i i m 

1 

• i ■ , , .| .. .... 

Size exclusion chromatography methods were performed using the Biosil SEC column, 
as described in Example 8 supra. The results suggest that the 2CFE 101 polypeptide 
5 forms monomer (40,200 Da) and oligomers (194,000 Da). The specific activity of the 

monomer and oligomeric forms of 2CFE 101 were determined, as described below. 

i ' . 

r ■ 1 ' ' , I 

»i 

i 

Biochemical Assays ■ • 1 , 

i * 1 ♦ 

i 

10 The biochemical assays of the 2CFE 101 polypeptide.was based on the PK/LDH coupled 
enzyme assays described by Vallari, D. S., e!t al. (1987 J. Biol Chem. 262:2468-2471) 
and Song, W. -J., et al., (1994 J. Biol Chem. 269:27051-27058). 

Briefly, the assay was performed as follows. The reaction included; 885 jxl of 0.1 M 
15 Tris-HCl (pH 7.6), 25 \A NADH (14.1 mM), 20 fxl ATP (10.7 mM), 50 ^1 phospho-enol- 
pyruvate (56 mM), 5 jal LDH/PK (lactose dehydrogenase/PK; Sigma, catalog # P-0294, 
60 U/ ml PK, 1050 U/ml LDH), 5 jil of the 2CFE 101 polypeptide (9 mg/ml in 50 mM 
Tris-HCl, pH 7.5, 100 mM NaCl which was diluted to 4.5 mg/ml in 50% glycerol). The 

* 

reaction was started by adding 10 jlxI pantothenate (100 mM; Sigma, catalog # P2250). 
20 The production of ADP in the reaction was monitored by measuring the absorbance a 340 
nm. The results in Figure 8 show that the. 2CFE 101 polypeptide mediates ADP 
production in the presence of pantothenate and ATP. The K m of pantothenate (n=4) was 
144 (±16.5) jjM, the V m ax of the 2CFE 101 polypeptide (n=4) was 2.04 (±0.25) |aM min" 1 
mg" 1 . The. monomer form has a specific activity of approximately 1.7 jxM min' 1 mg" 1 . 

* 

25 The oligomeric form has a specific activity of 0.26 jliM min" 1 mg" 1 . 

Alternatively, the 2CFE 101 polypeptide can be tested in an assay that monitors the 
conversion of pantothenate to 4'-phosphopantothenate. The same reaction described 
above can be used, except 14 C-labeled pantothenate is used. The. reaction can be 
30 monitored by measuring the amount of 14 C-labeled 4'-phosphopantothanate produced. 
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EXAMPLE 11 

i 

The following provides a description of the methods used to. characterise purified, CFE 

■ ■ i 

39 and CFE 21 polypeptides, carrying a C-terminal histidine 6-tag, The methods include 
5 helicase reactions, in which synthetic Holliday Junction templates are resolved into 
duplex structures. In one method, helicase reaction was monitored using radiolabeled 

templates. In another method, the helicase , assay, was adapted, for use in a high 

1 . • i , . • 

throughput assay employing fluorescence labeled, templates. 

i 

i i 

i .1 r 

10 Computer- Aided Comparison 

' , 1 ■ 

i I , 

* t 

The computer-aided comparison, as described in Example 9 sugrd, suggests that the CFE 
39 polypeptide (SEQ ID NO: 148) is an RuvA homologue. The comparison also 
suggests that CFE 21 (SEQ ID NO: 132) is an RuvB homologue. 

15 

Previous studies by Parsons and others have shown that RuvA and RuvB proteins, in K 
coli, promote branch migration or movement of Holliday Junctions during genetic 
recombination and DNA repair (Parsons, C. A., et al., 1992 Proa Natl., Acad. Sci. USA 
89:5452-5456; Tsaneva, I. -R., et al., 1993 Proc. Natl, Acad. Sci. USA 90:1315-1319; 
20 Muller, B., et al., 1993 J. Biol. Chem. 268:17179-17184; Mitchell, A. H. and S. C. West 
1996 J. Biol- Chem. 271:19497-19502; Parsons, C. A. and S. C. West 1993 J. Molec. 
Biol. 232:397-405; Tsaneva, I. R., et al., 1992 Molec. Gen. Genet. 235:1-10; Mitchell, A. 
H. and S. C. West 1994 J. Molec. Biol. 1994 243:208-215). 

25 Size Exclusion Chromatography 

Size exclusion chromatography was performed on 2CFE 39 (SEQ ID NO:366) and 2CFE 
21 (SEQ' ID NO:350) using the Phenomenex 600 x 7.5 mm Biosep SECS 3000 column, 
. as described in Example 8 supra. Protein standards (BioRad) were used to calibrate the 
30 column, including thyroglobulin (670,000 Da), gamma globulin (158,000 Da), ovalbumin 
(44,00 Da), myoglobin (17,00 Da), and B-12 (1350 Da). 

86 



NSDOCtD:<WO 0149721 A2 I > 



WO 01/49721 



PCT/USG9/35604 



10 



15 



20 



The results indicate that 2CFE 39 (RuvA) forms tetramers and 2CFE 21 (RuvB) forms a 
hexameric ring structure. Selected eluted samples were', electrophpresed on a 

polyacrylamide gel (Novagen) (Figure 9). 1 ' 1 , ' . 

i ■ 

The Holliday Junction Analysis Using Radiolabeled Templates , 

1 i _ 

The Holliday Junction analysis ( was performed using radiolabeled, synthetic, 
asymmetrical, Holliday Junction templates, as described in Hiom, K. and S. C. West 
1995 Cell 80:787-793. The Holliday Junction templates were produced by 1 Unhealing 
together four separate, single-stranded, oligonucleotide strands to form four-stranded 

structures (e.g., the Holliday Junction template). The Holliday Junction templates were 

i 

reacted with the 2CFE 39 and 2CFE 21 polypeptides, in a helicase reaction, to test their 
ability to generate two duplex structures. « 

Producing the Synthetic Holliday Junction Templates 

The asymmetrical Holliday Junction templates were produced by annealing the following 
oligonucleotide sequences: 

Oligonucleotide strand 1 : 

5 5 -CCAGTGATC ACATACGCTTTGCTAGGACATCTTGATATC AGCCC ACGTT 
CACCCGCCTACCAGTGCCACGTTGTATGCCCACGTTGACC-3' (SEQ ID NO:4*38) 



25 Oligonucleotide strand 2: 

5'-GGGTCAACGTGGGCATACAACGTGGCACTGGTAGGCGGGTGAACGTGGG 
CTGATATC AAGATGTCC ATCTGTCCGTTC ATCTATGACGT-3 ' (SEQ ID NO :43 9) 

Oligonucleotide strand 3 : 
30 5 '-AACGTCATAGATGAACGGACAGATCATGGTGCTTTTAAAGTCTAGAGAC 
TATCGAGCATTAGTACCAGTATCGAATCCGTCTTGTCAA-3' (SEQ ID NO:440) 
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Oligonucleotide strand 4: 




TTTAAAAGCACCATGTAGCAAAGCGTATGTGATC ACTG-3 ' (SEQ ID NO:44 1) 

5 

t i ■ ' 

Oligonucleotide strand 3 was labeled at the 5' end using approximately 300 ng of 
oligonucleotide strand 3, 1 \il lOx Phosphate Buffer, 5 pi 32 P ATP, 1 pi T4 polynuclotide 

kinase (Gibco-BRL)), in a 10 pi volume, and the reaction was performed at 37 degrees C 

1 I 1 

for 30 minutes. The reaction was loaded onto a G50 column to remove the 

10 unincorporated radiolabel. The final concentration of the radiolabeled oligonucleotide 

strand 3 was approximately 1 5 ng per pi. 

t 

Approximately equimolar amounts of the four oligonucleotide strands were annealed 
• (e T g., hybridized). The annealing reaction included: ,5 pi Annealing Buffer (200 mM 

15 Tris-Cl pH 8.0, 100 mM MgCl 2 , 1 M NaCl, 10 mM DTT); 450 ng of radiolabeled 
oligonucleotide strand 3; and 1000 ng each of oligonucleotide strands 1, 2, and 4; in 50 pi 
total reaction volume. The control annealing reaction included: 5 pi Annealing Buffer, 
60 ng radiolabeled oligonucleotide strand 3; 1000 ng oligonucleotide strand 4; in 50 pi 
total reaction volume. Annealing was performed at 95 degrees C for 5 minutes, 65 

20 degrees C for 30 minutes, 42 degrees C for 30 minutes, and room temperature (e.g., 
between about 23 to 27 degrees C) for 30 minutes to generate the synthetic Holliday 
Junction templates. The synthetic Holliday Junction templates were gel or column- 
purified to remove the duplex and non-annealed products. As a control, oligonucleotide 
strands 3 and 4 were annealed to form duplex structures. The synthetic Holliday Junction 

25 templates and duplex structures were stored at -20 degrees C. 

CFE 39 and CFE 21 : The Helicase Reaction Using Radiolabeled Templates 

The helicase reaction was performed to determine whether 2CFE 39 and 2CFE 21 
30 resolved the synthetic Holliday Junction templates into duplex structures. The helicase 
reaction was performed as follows. A 50 pi total reaction volume included: 25 jml of 2x 
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Reaction Buffer (50 mM Tris-Cl pH8.0, 30 mM MgCl 2 , 2 mM ATP); 1 jil synthetic 
Holliday Junction template (36 ng); 2 jal 2CFE 39 (1 uM); and 2 jal 2CFE 21 (1 uM). 

The reaction was incubated at 37, degrees for 30 minutes. The reaction was stopped by 
adding 5 ul Stop Buffer (100 mM Tris-Cl pH 7.5, 5 mg/ml Proteinase-^ 5% SDS). The 
5 stopped reaction was returned to 37 degrees C for 5 minutes. The helicase reaction was 

loaded onto and run on a non-denaturing, 12% PAGE, Tris-glycine gel. 

■ ; ' *■ , . ■ . 1 

The results shown in Figure 10, lanes 6, 7 and 8, indicate that the 2CFE 39 and 2CFE 21 
polypeptides resolved the synthetic Holliday Junction templates into duplex structures. 

I 

10 ' . 

! 

CFE 39: The Helicase Reaction 

■ 

It has been previously shown that E. coli RuvA binds to Holliday Junction templates 

E 

(Parsons, C. A., ef al., 1992 Proc. Nail, Acad Set USA 89:5452-5456). the ability of S. 

15 pneumoniae CFE 39 to bind to a Holliday Junction template can be tested by employing 
the helicase assay described herein. The results of the helicase assay can be monitored by 
performing a gel shift assay and/or capillary electrophoresis. The presence of a Holliday 
Junction template bound to 2CFE 39, which migrates more slowly than the Holliday 
Junction template alone, would indicate that S. pneumoniae 2CFE 39 binds to Holliday 

20 Junction templates. 

CFE 39 and CFE 21 : Holliday Junction Analysis Using Fluorescent-Labeled Templates 

The helicase reaction described herein was performed using Holliday Junction templates 
25 having one oligonucleotide strand labeled with a fluorescent agent and another strand 
labeled with a quenching agent. The 5' fluorescent end and the 3' quenching end of the 
strands that make up the Holliday Junction templates are in proximity to each other, 
resulting ,in a non-fluorescent template. When the Holliday Junction templates are 
resolved into duplex structures, the fluorescent and quench ends are not in proximity to 

* 

* 

30 each other, resulting in fluorescence. 
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The Holliday Junction templates used to perform this experiment comprised the 
following: the 5' end of oligonucleotide strand 1 was labeled with a fluorescein (e.g., the 
• fluorescent, agent), and the 3* end of oligonucleotide strand 4 \yas labeled with DABCYL' 
(e.g., the quenching agent). The oligonucleotide strand 1 labeled with fluorescein' and the 
5 oligonucleotide strand 4 labeled with DABCYL were custom synthesized (Gibco-BRL 
Life Technologies, Inc.). , . . ' , 

The fluorescein and DABCYL labled oligonucleotides were annealed in a reaction, as 
described above, to generate synthetic Holliday Junction templates. The helicase reaction 
10 was performed as described above. The results of the helicase reaction were monitored 

by measuring the unquenching of the Holliday Junction templates with time (Figure 1 1). 

t * 

I i 

The helicase assay using Holliday Junction templates labeled with fluorescent-quenching 
agents can be adapted for use in high throughput analyses to test 2CFE 39, 2CFE 21, and 
1 5 other polypeptides for their ability to resolve the templates into duplex structures. 

EXAMPLE 12 

m 

The following provides a description of the methods used to characterize purified, CFE 8 
20 polypeptide, which lacks a histidine tag. The CFE 8 is a putative DNA single-stranded 
binding protein. 

Computer- Aided Comparison 

25 The computer-aided comparison, as described in Example. 9 supra, suggests that the CFE 
8 polypeptide (SEQ ID NO: 121) may be a single stand binding protein homologue, such 
as SSB. 
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Size Exclusion Chromatography 

• ... <. 

The 2CFE 8 polypeptide (SEQ ID NO;339) was characterized by size exclusion • 
chromatography, using the Biosil SEC-125 HPLC Gel Filtration column 1 as described in 
5 Example 8 supra. The chromatogram showed one peak corresponding to a molecular 
weight of approximately 89,kDa. Based on the nucleotide sequence, the predicted 

molecular weight of 2CFE 8 is 17,351 Da. In non-denaturing conditions, 2CFE 8 forms a 

i * 

multimef. . , , 1 1 

i i i , 

i 1 

10 Binding Reaction • 

i * i * ■ 

The 2CFE 8 polypeptide was reacted with a single-stranded oligonucleotide A, Briefly, 
the binding reaction included: 50 \iM of 2CFE 8 polypeptide, 50 pM oligo strand A, 20 

mM Tris/20 mM KC1 pH 7.5. The binding reaction was performed at 37' degrees C, for 2 

i 

15 hours. 

t 

i 

Oligonucleotide strand A: 

5 ' -TTAGGGCCCGGGCTATCTTAC AATCTCGTT-3 * (SEQ ID NO:442) 
20 Capillary Electrophoresis 

The results of the binding reaction was monitored by capillary electrophoresis, following 
the methods described in "Handbook of Capillary Electrophoresis" 2 nd Edition, 1997, ed. 
J. Landers. 

Separation was performed using an uncoated capillary tube (360 |nm o.d., 50 |xm i.d., 
with a 50 cm effective separation length; Watrex International, Inc., Pittsford, NY) and 
50 mM borate pH 9.3 as the mobile phase, at 25 kVolts, 20 minutes separation time. 

The results indicate that 2CFE 8 alone elutes as a sharp peak, indicating little adsorption 
to the uncoated capillary wall (Figure 12 A). The shape of the peak and peak retention 
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time changed with 2CFE 8 in the presence of all oligonucleotides tested (Figure 12 B). 
As a negative control, MurB polypeptide (Pucci, M. J., L. F. . Discotto, and T. J. 
Dougherty 1992 "Cloning and Identification of the Escherichia coli murB DNA 
sequence, which' encodes UDP-N-acetylenolpyrtivoylglucosainine reductase" J. 
5 Bacterial 174:1690-1693) was reacted with the, same oligonucleotides. MurB reacted 

with or with out the oligonucleotides showed no change in peak shape or retention time. 

ii . • ■ i ■ 

, . ■ • . - i 

: . 1 ( ' 

1 I | ■ . ■ 

After capillary electrophoresis analyses, the 2CFE8 alone and 2CFE plus oligonucleotide 
samples were run on 1 native polyacrylamide gels to determine whether the polypeptide 
10 was intact. The results indicate that in all cases, 2CFE 8 was intact and had not degraded 
with time or storage. 

i ' i 1 
Mobility Shift Assays 

1 ' i 

15 The ability of 2CFE 8 polypeptide to bind oligonucleotide strand A was tested in a 
mobility shift assay. 

* 

The results indicate that 2CFE 8 binds single stranded oligonucleotides (Figure 13 A and 
B). In Figure 13 A, the gel was stained with ethidium bromide. The unbound 
20 oligonucleotides appear near the bottom of the gel, while the bound oligonucleotides 
appear near the middle. The same gel was stained with Coomassie (Figure 13 B), 
revealing that 2CFE 8 polypeptide bound -to the oligonucleotide migrated further than 
unbound 2CFE 8, due to the change in charge carried by the oligonucleotide. Various 
ratios of 2CFE8:oligo were tested. The optimal binding ratio was 2:1. 

25 

The Effect ofMgCl 2 

The 2CFE 8 polypeptide precipitated in the presence of 5 mM MgC^. The precipitation 
was reversible by the addition of 1 pM of the oligonucleotides tested. The observation 
30 indicates specific binding between 2CFE 8 polypeptide and the oligonucleotides tested. 
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Scintillation Proximity Assay I 

Scintillation proximity, assay (SPA) methods can be used in ; a high throughput screening 
procedure to monitor, for example, a binding reaction. SPA utilizes beads (Amersham) 
5 which are coated on the surface with a particular compound or molecule. For example, 

the SPA bead may be coated with avidin to facilitate binding with any molecule having a 

« i 

biotin tag. 

■■i 

The binding reaction of the 2CFE 8 polypeptide and the oligonucleotide strand A can be 
10 monitored using SPA beads and a scintillation counter. The beads can be cpated with 
avidin, the 2CFE 8 polypeptide can be tagged with biotin, and the oligonucleotide strand 
A can be radiolabeled. 



15 



EXAMPLE 13 

i 
i 

i 

The following provides a description of the methods used to characterize purified, 2CFE 
3 (SEQ ID NO:334) and 2CFE 86 (SEQ ID NO:409) polypeptides. 

* » 

The 2CFE 3 polypeptide catalyzes the conversion of D-glucosamine-6-phosphate to D- 
20 glucosamine- 1 -phosphate, indicating that 2CFE 3 mediates amino-sugar biosynthesis 
through the N-acetyl glucosamine pathway (Figure 14). 

The 2CFE 86 polypeptide catalyzes the conversion of D-glucosamine-1 -phosphate to N- 
acetylglucosamine-1 -phosphate, and the conversion of N-acetylglucosamine-1 -phosphate 
25 to UDP-N-acetylglucosamine-1 -phosphate, which indicates that 2CFE 86 also mediates 
amino-sugar biosynthesis through the N-acetyl glucosamine pathway (Figure 14). 

Computer-Aided Comparisons Of CFE 3 

30 The computer-aided comparison, as described in Example 9 supra, suggested that the 
CFE 3 polypeptide (SEQ ID NO: 1 16) is a phosphoglucosamine mutase, such as GlmM. 
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i 

Purification of the CFE 3 Polypeptide 1 

, The: 2 CFE 3 polypeptide was produced, using the large scale. EPTG-induced method 
described in Example 5, supra. The 2 CFE 3 polypeptide lacks a C-terminal hi^idine tag. 
5 The 2CFE 3 polypeptide was purified using a 2-coIumn procedure. The 2CFE 3 
polypeptide preparation was eluted from a 26/10 Q Sepharose column (Pharmacia) using 
a 0-1.0 M NaCl gradient, 2 ml/minute flow rate, and the gradient size was 1 liter. Then 
the 2CFE 3 polypeptide was eluted from a hydroxyapatite Bio-gel column (Bio-Rad) 
using a, 5-200 mM potassium phosphate (pH 8.0) gradient, the flow rate was 0.3 
10 ml/minute, and the gradient size was 300 ml. A sample of the 2CFE 3 preparation was 
electrophoresed on an SDS polyacrylamide gel (Figure 4). 

t i 

Affinity Capillary Electrophoresis of CFE 3 

i 

! > 

1 5 Affinity capillary electrophoresis methods were used to determine whether the 2CFE 3 

i 

polypeptide binds to various glucose derivatives. Binding was performed under 
equilibrium conditions, in which the sugars were dissolved in the running buffer and 
reacts with 2CFE 3 during separation in the column. The affinity capillary 
electrophoresis method used to analyze 2CFE 3 follows the methods described in 
20 "Handbook of Capillary Electrophoresis" 2 nd Edition, 1997, ed. J. Landers. 

i 

Briefly, 2CFE 3 polypeptide was reacted with increasing amounts of various glucose 
derivatives (e.g., substrate) at 25, 30 and 37 degrees C. .The glucose derivatives included 
UDP-glucose, glucose- 1 -phosphate, glucose-6-phosphate, glucosamine- 1 -phosphate, and 
25 glucosamine-6-phosphate. The reaction included: 2CFE 3 polypeptide (2.0 mg/ml), 
separation buffer (25 mM Tris; 192 mM Glycine, pH 8.0; BupH Tris-Glycine Buffer 
Packs, Pierce). Separation was performed at 25 kVolts, separation time was 1 5 or 20 

■ 

minutes. 



30 



The results shown in Figure 15 A indicate that at 25 degrees C, 2 CFE 3 binds to D- 
glucose-1 -phosphate in a dose-dependent manner, as the peak shape and/or the retention 
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time for 2CFE 3 changes in the presence of 100 and 500 pM D-glucoserl -phosphate 
compared to unreacted 2CFE 3. 

The results shown in Figure 15 B indicate that at 25 degrees, C, 2CFE 3 binds to D- 
5 glucosamine-6-phosphate in a dose-dependent manner, as the peak, shape and/or the 
retention time for 2CFE 3 changes in the presence of 100 and 500 |uM D-glucosamine-6- 
phosphate compared to unreacted 2CFE 3 . ' • 

The results shown in Figure 15 C indicate that at 25 degrees C,. the 2 CFE 3 polypeptide 
10 also binds to glucose-6-phosphate. 1 

1 » 1 

A comparison of 2CFE 3 reacted with various glucose derivatives, at 30 degrees C, is 

shown in Figure 1 5 D. The results indicate that D-glucosamine-6-phosphate is a putative 

substrate for 2CFE 3, as this reaction exhibits the greatest change in peak shape and/or 

i 

15 retention time. 

CFE 3: Capillary Electrophoresis and Laser-Induced Fluorescence 

In a further analysis of 2CFE 3 polypeptide, capillary electrophoresis was performed with 
20 laser-induced fluorescence in order to separate and detect interaction between the 
substrate (e.g., D-glucosamine-6-phosphate) and the product (e.g., D-glucosamine-1- 
phosphate) in a one dose, one time-point procedure. 

The 2CFE 3 polypeptide was derivitized by reacting 10 mM FITC (fluorescein 
* 25 isothiocyanate dissolved in methanol; Calbiochem, San Diego, CA) with D-glucosamine- 
6-phosphate, at ambient temperature, in the dark, overnight. The FITC-derivatized 2CFE 
3 polypeptide (2.0 mg/ml) was reacted with the substrate (D-glucosamine-6-phosphate 
and D-glucosamine-1 -phosphate) for one hour. 

30 Separation was performed using an uncoated capillary (360 jim o.d., 50 pm i.d., with a 
• 50 cm effective separation length) and 50 mM borate (pH 9.3) as the mobile phase. The 
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argon-ion laser had an excitation wavelength of 488 nm and an emission filter of 520 nm 

i i • ■ * 

(Beckman, Fullerton, CA). The results shown in Figure 16 indicate that 2CFE 3 binds 
and catalyzes the conVer'sion of D-gliic.osamine-6-phosphate ito D-glucosamine-l- 
phosphate. . " ' • ' ' 

i 

i 

5 . 

Computer-Aided Comparison Of CFE 86 , ' ' 

The comparison results, as described in Example 9 supra, suggested that the CFE 86 
polypeptide (SEQ ID NO: 195) is an acetyltransferase, such as GlmU which is a 

10 bifunctional enzyme in K coli. It has been previously shown that, in E coli, GlmU is a 
bifunctional protein having* both the acetyltransferase and uridylyltransferase active sites 
(Mengin-Lecreulx, D. and J. van Heijennort 1994 J. Bacteriol 176:5788-5795; Gehring, 
Al., et al., 1996 Biochemistry 35:579-585). The bifunctional enzyme catalyzes the 
conversion of D-glucosamine-l -phosphate to N-acetylglucosamine-1 -phosphate 

15 (acetyltransferase), and catalyzes the conversion of N-acetylglucosamine-1 -phosphate to 
UDP-N-acetylglucosmine-1 -phosphate (uridylyltransferase). The Km of the 
acetyltransferase and uridylyltransferase reactions has been previously calculated 
(Mengin-Lecreulx, D. and J. van Heijennort 1994 supra ). Additionally, the crystal 
structure of GlmU from E. coli is known (Brown, K., et al., 1999 EMBO J. 18:4096- 

20 4107). 

Purification of the CFE 86 Polypeptide 

The 2CFE 86 polypeptide (SEQ ID NO:409) has a C-terminal histidine tag. The 2CFE 
25 86 polypeptide was produced using the large scale IPTG-induced method described in 
Example 5, supra. The 2CFE 86 polypeptide was purified using the Ni-NTA affinity 
column method described in Example 6, supra. The eluted 2CFE 86 polypeptide was 
dialyzed against 50 mM Tris-Cl, 100 mM NaCl, 25% glycerol, pH 8.0. Samples of the 
purified 2CFE 86 polypeptide were electrophoresed on a polyacrylamide gel (Figure 17). 

30 
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t 

. Coupling CFE 3 and CFE 86 to Produce UDPAG 

A biochemical assay was performed, to determine whether 2CFE 3 and 2CFE 86 convert 
D-glucosamine-6-phosphate to UDP-N-acetylglucosamine-1 -phosphate (e.g., UPPAG). 
5 The 2CFE 3 and 2CFE 86 polypeptidies were used in a coupled reaction based on the 
assays described in Jolly, L. P., et al., 1999 Eur. J. Biochem. 262:202-210. 

A time-dependent and dose-dependent assay were performed. Briefly, the assay was i 
performed in 96- well plates, each well including 100 jjl volume. The assay included: 1 

10 mM D-glucosamine-6-phosphate (Sigma); 0.7 mM D-glucosamine-l,6-diphosphate 
(Sigma); 1.2 mM acetyl-Coenzyme A (Sigma); and 5 mM uridine-5 '-phosphate (Sigma); 
3 mM MgCl2 (Sigma); 50 mM Tris-Cl, pH 8.0 (Life Technologies). The reaction was 
started by adding 1 \ig of 2CFE 3;' and : 1 0 p,g of 2CFE 86. The reaction was performed at 
room temperature. The reaction was stopped at 0, 15, 30, and 65 minutes, by filtering out 

15 the'2CFE polypeptides. 

i 

The results of the assay was monitored by HPLC (high pressure liquid chromatography) 
using an Optisil IOjj, SAX column (250 x 4.6 mm), measuring at 262 nm, the mobile 
phase was 150 mM KH2PO4 (pH 3.5), and 1.5 ml/minute flow rate. The results shown in 
20 Figure 1 8 show the time-dependent assay and indicate that HPLC detected the presence 
of UDPAG. 

CFE 86: The Uridylyltransferase Region 

25 The 2CFE 86 polypeptide was tested in a uridylyltransferase reaction, in which N-acetyl- 
D-glucosamine-1 -phosphate and UTP produce UDP-N-acetylglucosamine. The 
uridylyltransferase reaction was monitored using a malachite green/inorganic 
pyrophosphatase assay (e.g., malachite green-IPPAse assay) and/or monitored using 
HPLC. The malachite green-IPPAse assay was used to measure orthophosphate 

30 production from digestion of the pyrophosphate liberated in the uridylyltransferase 
reaction. 
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The malachite green reagent was prepared as follows. A 0.045 % solution of malachite 
green (Sigma; M9636) was prepared in water.. A 4.2 % solution of ammonium 
molybdate (Mallinckrodt) was prepared in 4N HC1. The malachite green and ammonium 
5 molybdate were mixed in a 3:1 ratio, and stirred for about 20 minutes. The mixture was 
filtered, and stored at 4 degrees C. The inorganic pyrophosphatase (Sigma; 1-2267) was 
diluted to 0.1 U/jal in'50 mM Tris/3mM MgCl 2 ph 8.0, and stored at 4 ciegrees C. 

The uridylyltransferase reaction was performed in 96-well plates. The coupled reaction 

. . • i 

10 described herein was performed, in the presence of 2CFE 3 alone or 2CFE 3 and 2CFE 

86, and included the addition of 0.5 U/well of the diluted inorganic pyrophosphate. The 

reaction was mixed for 5 minutes at room temperature. The reaction was stopped by the 

addition of 240 jal/well of the malachite green reagent and 30 |il/well of 34% sodium 

* • ' • 

citrate, and the reaction was mixed. The results of the uridylyltransferase reaction was 

1 5 monitored by spectrophotometry at 660 nm. 

The results of separate uridylyltransferase reactions were monitored by HPLC, using a 
Phenosphere-NEXT CI 8 column (250 x 4.6 mm). The mobile phases included A and B 
as follows: A) methanol/10 mM potassium phosphate pH 6.5 (0:100); and B) 
20 methanol/10 mM potassium phosphate pH 6.5 (40:60). The mobile phases were ;run 

r 

under the following conditions: 100% mobile phase A for 5 minutes,* to 100% mobile 

to _ 

phase B in 3 minutes; and hold 1 00% mobile phase B for 9 minutes. The retention time 
for the UDPAG product is approximately 5.75 to 6.0 minutes. . . . 

25 The results three uridylyltransferase reactions, monitored by HPLC are summarized in 
Table in below. 
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TABLE ni 



i 



Purified CFE 86: 

2CFE 86-1 
2CFE86-2 
2CFE 86-3 



Specific Activity 
(nmol/min/ug): 

' 3.1 , 
3.4 
3.1 



The results of the uridylyltransferase reactions, monitored by HPLC or HPLC and 
Malachite Green IPPAse assays are summarized in Table IV below. 



i.' * 



10 



TABLE IV 



Reaction: 

Acetvltransferase reaction: 

Glucosamine- 1 -P 
Acetyl-coA 

Uridvlvtransferase reaction: 

N-acetylglucosamine- 1 -P 
UTP 



Km (uMY. 



Method: 



94 
150 



48 
79 



HPLC 
HPLC 



HPLC and MG/IPPAse 
HPLC 



15 



20 



EXAMPLE 14 



The following provides a description of the methods used to characterize various 2CFE 
polypeptides, including CFE 21, 34, 35, 39, and 90. The molecular weight of these 2CFE 
polypeptides were analyzed by size exclusion chromatography and gel electrophoresis. 
The 2CFE 34, 35, and 90 polypeptides putatively mediate fatty acid biosynthesis. 
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Computer-Aided Comparison 

■ 

* 

1 The computer-aided comparison, as described in Example 9 supra, suggests that CFfi 34 
(SEQ ID NO:143), CFE 35 (SEQ ID NO:144),' and 90 (SEQ ID NO:l'99) are 

5 polypeptides which mediate a fatty acid biosynthesis pathway (Figure 19) 

i i 

, • « ' 

The comparison suggests that CFE 34 is a malonyl CoArACP transcylase, which 
catalyzes the reaction in which malonyl CoA and acyl carrier protein (ACP) are 
converted to malonyl- ACP and CoA. Thus, the CFE 34 polypeptide may be a homologue 
10 ofE. coliFabD. 



i ■ • 



. The comparison suggests that CFE 90 is a 3-oxoacyl-ACP synthase II (beta ketoacyl- 

ACP synthase II) which catalyzes the reaction in which malonyl-ACP is converted to 

I 

beta aceto acetyl-ACP, Thus, the CFE 90 polypeptide may be a homologue of J?, coli 
15 FabF. 



The comparison suggests that CFE 35 is a 3-pxoacyl-ACP reductase (beta aceto acetyl 
ACP reductase) which catalyzes the reaction in which beta-keto-acetyl-ACP is converted 
to beta-hydroxy-acetyl-ACP. Thus, the CFE 35 polypeptide may be a homologue of E. 
20 co/ZFabG. 



Size Exclusion Chromatography 

* 

The estimated molecular' weights of 2CFE 34 (SEQ ID NO:361), 2CFE 35 (SEQ ID 
25 NO:362), and 2CFE 90 (SEQ ID NO:413) were determined using the Biosil SEC-125 
HPLC Gel Filtration column as described in Example 8, supra. 

The results suggest that 2CFE 34 polypeptide is a monomeric protein (33,093 Da), 2CFE 
35 is a trimeric protein (25,758 Da; approximately 85%), and 2CFE 90 is a dimeric 
30 protein (43,930 Da). Selected eluted samples of 2CFE 34 were electrophoresed on a 
polyacrylamide gel (Figure 20). 
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t 

11' 

Biochemical Assay: CFE 34 

The function of 2CFE 34 was determined by performing various biochemical reactions. 
5 To determine whether 2CFE 34 catalyzes the convertion of malonyl-CoA to malonyl and 
CoA, the following reaction was performed, 

' i 

The biochemical reaction was performed' in the presence of acyl carrier protein. The 
reaction included the following: 10 pM ,4 C labeled malonyl-CoA, 20 \xM ACP, 30 |jM 

10 2CFE 34 (e.g., FabD) in 20 mM Tris-Cl, pH 8.0 and 5 mM DTT in 300 jol volume.' The 
reaction was performed at room temperature (e.g., approximately 24 degrees C) for 30 
minutes. The reaction was terminated with the addition of 45 ul of 0.5% TFA. The 
labeled reaction was injected onto a MonoQ 5/5 column on a Gilson HPLC. Detection 
was performed by monitoring the radioactivity of the continuous flow-through of the 

15 HPLC effluent. Chromatography was performed using a buffer gradient for column 
elution. Buffer A included 20 mM Tris-Cl, pH 8.3. Buffer B was the same as Buffer A 

* 

and included 1 M NaCl. The program was held at 90% A, 10% B for 10 minutes 
followed by a linear ramp to a final mix of 50% of each Buffer A and B over 10 minutes. 

20 The substrate (e.g., 14 C malonyl-CoA) eluted at 9.9 minutes, the product (e.g., 14 C 
malonyl-ACP) eluted at 14.3 minutes. The results indicate that CFE 34 catalyzes the 
conversion of malonyl-CoA and acyl carrier protein (ACP) to malonyl-ACP and CoA. 

EXAMPLE 15 

25 

The following provides a description of the methods used to characterize CFE 
polypeptides 40, 41, and 46. 
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Computer-Aided Comparison 

i i • * 

« . ■ 

i 

The computer-aided comparison, as described in Example 9 supra^ suggests that the CFE 
40 polypeptide (SEQ ID NO: 149) is a phosphomethylpyrimidine (HMP-P) kinase 

i t - 

\ 

5 involved in thiamine biosynthesis. 

' I' . • ' I 

' ' I I ' 

The comparison, as described in Example 9 supra- suggests that the CFE 41 polypeptide 
(SEQ ID NO; 1 50) has a QTP -binding motif and may be a protease. 

10 The comparison, as described in Example 9 supra, suggests that the CFE 46 polypeptide 
(SEQ mNO:155) has an ATP-binding motif. ' ■ 

■ i ' i ■ • 

Affinity Purification of CFE 41 

i 

15 The large-scale method described in Example 5 supra (e.g., IPTG-induced protein 
production) was used to prepare a sample of 2CFE 41 polypeptide (SEQ ID NO:368). 
The sample was affinity purified using the Ni-NTA method described in Example 6, 
supra. The eluted fractions were loaded onto and run on a 12% SDS-PAGE gel (Novex) 
(Figure 21). 

20 

Circular Dichroism and Circular Dichroism Thermal Melt Analysis 

Circular dichroism and circular dichroism thermal melt methods were performed using 
JASCO instrumentation. The concentration of the isolated 2CFE 40 (SEQ ID NO:367) 
25 was approximately 21 jjM, in a 0.1 cm pathlength cell at 210 nm. The circular 
dichroism spectrum suggests that this preparation of 2CFE 40 had mixed alpha and beta 
secondary structure. The circular dichroism thermal melt spectrum suggests that 2CFE 
40 has a T m of approximately 67 degrees C. The 2CFE 40- polypeptide precipitates at 
approximately the T m . 
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i 

i . . . , 

The concentration of the isolated 2CFE 4l (SEQ ID NO:368) was approximately 70 jjM, 
i in a 0.02 cm pathlerigth cell. The circular dichroism spectrum suggests that ithii 
* preparation of 2CFE 41 had mixed alpha and beta secondary structure, with a greater 

j 

5 percentage of alpha structures. The circular dichroism thermal melt spectrum suggests 

that 2CFE 41 has a T m of approximately 38 degrees C. The 2CFE 4} polypeptide 

i 

precipitates at approximately the T m . 

1 . ' , ' r ' • 

The concentration of the isolated 2CFE 46 (SEQ ID NO:373) was approximately 23 ^M, 
10 . in a 0.1 cm pathlength cell at 280 nm.' The circular dichroism spectrum suggests. that this 

i 

preparation of 2CFE 46 had mixed klpha and beta secondary structure. The circular 
dichroism thermal melt spectrum suggests that 2CFE 46 is highly stable at elevated 

■ i 

temperatures. At 90 degrees C, the 2CFE 46 polypeptide exhibited only a 27% loss in 
signal and the polypeptide remained soluble. 

15 : . 

i 

Capillary Electrophoresis . 

Capillary electrophoresis was performed on samples of purified 2CFE 40, 41 and 46. 
The electropherograms of 2CFE 40, 41, and 46 are shown in Figure 22. 

20 

EXAMPLE 16 

The following provides a description of methods that can be used to characterize CEG 
polypeptides (e.g., GFE polypeptides). 

25 

Computer- Aided Compilation 

Computer-aided compilation of bacterial metabolic pathways may be analyzed using 
Pathway Tools from Doubletwist, based on the EcoCyc system (Karp P.D., et al. s 1999 
30 Nucleic Acids Res. 1999 27(l):55-58). This analysis may be used to predict which CFEs 
mediate various steps of the pathways. This information may be used in combination 
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* . I 

with the results of a binding reaction which identifies a ligand or substrate that binds with 

■ ' 

a CFE polypeptide. 

i . , * • i 

Identifying the Function of a CFE Polypeptide ' ' ,' 

5 

i 

The function of a CFE polypeptide may be identified by identifying a ligand or substrate 

which binds with the CFE polypeptide. The ligand or substrate may be identified using 

* • I ■ 1 . ' ' ' ' 

fractionation and affinity capillary electrophoresis methods. The following method is 

based upon the assumption that the bacterial cell lysate includes the ligand or substrate. 

io ■ i ■ " 1 1 ' 

i 

A bacterial host cells carrying an endogenous (e.g. native) CFE gene or carrying a 
recombinant vector which includes a CFE gene may be cultured so that the CFE 

i 

polypeptide is produced by the cell. The cells may be ruptured in order to obtain the cell 

i t 

lysate. The cell lysate may be fractionated using HPLC technology. The HPLC fractions 

i 1 

15 may be reacted with a CFE polypeptide in a binding reaction, and the binding reaction 
may be analyzed by affinity capillary electrophoresis methods. The ligand or substrate 
which reacts with the CFE polypeptide may be identified using mass spectrophotometry 
methods (in "Mass Spectrometry" 1990 eds. McCloskey, J. A., in Methods in 
Enzymology volume 193; Henion, J., et aL, 1993 "Mass Spectrometric Investigations of 

20 Drug-Receptor Interactions" Ther. Drug Monit. 15:563-569; Loo, J. A., et al., 1999 
"Application of Mass Spectrometry for Target Identification and Characterization" Med. 
Res, Rev. 19:307-319; Nguyen, D. N., et al., 1995 "Protein Mass Spectometry: 
Applications to Analytical Biotechnology J. Chromatogr. 705:21-45). 

25 EXAMPLE 17 

The following provides a description of nuclear magnetic resonance (NMR) spectroscopy 
methods that were used to characterize CFE polypeptides. 

30 High resolution NMR spectroscopy was applied to I5 N-labled, !3 C/ ! ^-labeled, 
H/ CI N-labeled, and type-specifically isotopically labeled CFE polypeptide samples 
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in the solution state for the following purposes: to assess various aspects of the structural 
state, e.g., foldedness, structural integrity; to refine a previously determined experimental 
structure of a close sequence homologue; to refine a homology7mddeled structure; to 
assess the potential for a CFE polypeptide , to bind small molecules; and to identify small- 
molecule pharmacophdric fragments that bind specifically to the CFE polypeptide 
(VNucIear Magnetic Resonance" 1994 eds. James, T. LJ in Methods in Enzymologj? 
volume 239). , ' 



• i 



The NMR analysis includes screening both a compound deck, of approximately 4,500 
commercially available, structurally and chemically diverse compounds (the small- 
molecule pharmacophore deck) and a compound deck of proprietary, known, anti- 
microbial compounds (anti-microbial deck) against the CFE polypeptides (i.e., target 
polypeptides) to determine, either based upon perturbations to the chemical shifts of the 
amide proton iand/or nitrogen resonances, as measured from a two-dimensional proton- 
nitrogen heteronuclear single-quantum correlation spectrum (2D screening method), or 
based upon increases in the linewidth of the compound's proton resonance(s), as 
measured by a one-dimensional Ti p spin-lock difference spectrum (ID screening 
method), both whether a compound binds to a CFE polypeptide and, in the case of the 2D 
screening method, where the compound binds on the CFE polypeptide. 

Isotopic Labeling of CFE Polypeptides 

BL21-DE3 E. coli bacteria are transformed with the CFE expression vectors. Expression 
takes place between 20°C and 37°C in minimal media containing [ 15 N]- ammonium 
sulfate as the sole nitrogen source and either glucose, [ 2 H]i3-glucose, or [ 13 C]6-glucose as 
the sole carbon source. Glucose is used for preparing uniformly 1 ^-labeled and 2 H/ 15 N- 
labeled CFE polypeptides. [ H]i 3 -glucose is used for preparing type-specifically 'H/^C- 
labeled, uniformly I5 N-labeled CFE polypeptides. [ 13 C] 6 -glucose is used for preparing 
13 C/ 1 ^-labeled CFE polypeptides. The minimal media is prepared in 100% H2O for 
expressing both uniformly 15 N-Iabeled and uniformly 13 C/ 1 ^-labeled CFE polypeptides; 
the minimal media is prepared in 95% D 2 0 (deuterium oxide) and 5% H 2 0 for expressing 
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both type-specifically ^^C-labeled, uniformly 15 N-labeled. and just uniformly 2 H/ 15 N- 
labeled CFE polypeptides. In the case of type-specifically 1 H/ 13 C-labeled, uniformly 15 N- 
1 labeled CFE polypeptides, 40 mg/L of protonated and uniformly ^I^^N-lateled' 
isoleucine, valine and leucine amino acids are added to the minimal media. ' * 

NMR Screening • ■ , 1 

- ■•■ . ■ . 1 , ■ ' 

Compounds in the anti-microbial deck are pre-dissolved to a target concentration of 16 

i * » 

m!V[ in deuterated DMSO (dimethylsulfoxide) with each deck well containing only one 

1 'if'. 
10 compound. Compounds in the small-molecule, pharmacophore deck are pre-dissolved in 

deuterated dmso to a target concentration of 50 mM in groups of 8, i.e., each deck well 

contains 8 unique compounds with each compound at a target concentration of 50 mM. 

* h 

j 

3.5 \il of compound is placed at the bottom of a well in .a 96-well, screening plate. This 
1 5 well will be referred to as the compound screening well. Each compound screening well 
contains solution from only one deck well. 166.5 pi of buffer is added to each compound 
screening well. 170 pi of a CFE polypeptide solution, initially at a concentration ranging 
from 200-300 pM, is added to each compound screening well; the contents of that well 
are then thoroughly mixed. The control screening well contains only 3.5 pi of deuterated 
20 dmso. The screening plate is then centrifuged in a bucket rotor for 15 minutes at 3,500 
rpm to insure that all particulate matter is at the bottom of the well. 

The 2D screening method requires a single control screening well in which the compound 
solution consists only of deuterated DMSO. The ID screening method requires a control 
25 screening well for each compound screening well. In the case of the ID screening 
method, the control screening well is prepared identically to the compound screening 
well except that the 170 pi of a CFE polypeptide solution is replaced by 170 pi of buffer. 

The screening plate is covered with aluminum foil and placed onto a rack of a Gilson 
30 liquid handler. The Gilson liquid handler, under computer control by the NMR host/data- 
acquisition software, is responsible for removing each sample from the screening plate, 
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injecting the sample into a high-resolution, ! H/ 15 N double-resonance NMR flow-prbbe, 
removing the sample from the flow-probe, and dispensing it back into the screening plate 
well from which the sample wa? originally removed. NMR dat^ are collected on the 
sample while the 1 sample resides in the, NMR flow-probe. The type' of NMR data 
5 collected depends upon whether the 2D or ID screening method is being used. 

i . . 

i < ' ' ■ 

Determining Structural Characteristics of a CFE Polypeptide ' 

In assessing various aspects of the structural state of a CpE polypeptide, NMR was used 
10 to provide the following information. The proton ID spectra and proton-nitrogen 2D 

r 

correlation NMR spectra were used to assess the overall foldedness of a CFE polypeptide 

1 i 
without actually describing in detail that folded state. Unfolded and substantially 

misfolded proteins produced distinct signatures in these two types of NMR spectra. 

i 

15 The chemical shift of most protein nuclei in either the set {Hn, H a , Hp, C\ Qx> Cp, N} or 
the set {Hn, C\ C a , Cp, N} for perdeuterated (e.g., 2 H-labeled) proteins were determined 
by procedures well known in the art that involve collecting up to 10 triple-resonance 
NMR data sets. The protein secondary structure was delineated as either helical, turn or 
extended (e.g., p -sheet) by measuring A(8ca - Sep), A5C\ and Abu* where 8 refers to the 

20 chemical-shift value and A refers to the difference between chemical-shift values 

measured in this protein and those measured for the same residue type in a random-coil 

(unstructured), tetrameric peptide. 

* 

This secondary-structure profile was generated in approximately 2-3 weeks per protein. 
25 The secondary-structure profile was used to confirm the functional identity of a protein. 
It was also used to refine the list of possible functional identities of folds, predicted by 
various computational techniques including fold recognition which is associated with a 
protein or polypeptide. 
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NMR was used to generate folds of proteins or polypeptides for which both no structure 

was known of a sequence homologue and no structural homologue was discernible in the 

i i ( ... 1 i , . i .... 

PDB by fold recognition techniques. ' ■ , " . ■ 

1 i ■ 

■ ■ , ■ i .1 

i 

5 Refining a Structural Model 

Nuclear Overhauser (NOE) data were used to refine both horriology : modeled structured 
and previously determined experimental structures of close sequence homologues. This 
process took approximately 2-3 weeks per structure. 
10 ' , 

The CFE 88 polypeptide was characterized by NMR analysis, to establish its secondary 
structure. The NMR data was used to filter the computer-aided threading analysis. The 
NMR-determined secondary structure for CFE 88 suggested that CFE 88 is structurally 
similar to 4-aminoimidazole carboxylase. 

15 

i 

The characteristics of other CFE polypeptides were analyzed by NMR methods. A 
computer-aided threading analysis revealed that the N-terminal domain of the protein 
EGA, which both binds and hydrolyzes GTP, was both structurally similar and 
sufficiently similar in sequence to CFE 52 to suggest that CFE 52 had a similar function. 

20 

The NMR data of CFE 103 suggests that this polypeptide is unfolded. Circular dichroism 
spectra, as a function of temperature, also indicated that CFE103 was unfolded. 

The CFEs 2, 42, 43, 68 and 88 polypeptides were tested for their ability to bind potential 
25 inhibitor molecules by screening both the anti-microbial deck and the small-molecule, 
pharmacophore deck. CFE 34 was tested for its ability to bind potential inhibitor 
molecules by screening the anti-microbial deck. 
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i i 

Characterizing Small-Molecule Binding 

■ ' 

i NMR-based screening was used to measure binding against both the small-mpiecule, 1 
pharmacophore deck and the anti-microbial deck. ' Binding data from 1 these screens 
5 allowed assessment of the propensity of a protein to bind small molecules. The binding 

i 

data was also used to identify sites on the protein which are capable of binding small 

molecules. The binding data was also used to identify common pharmacophores among 

1 . i 

the compounds which bind. 

10 Reverse screening refers to a process whereby known anti-microbial compbiinds, the 
microbial target of which is unknown, are screened by a general method, e.g., binding as 
assessed by NMR, to find a physical interaction with polypeptide targets previously 

t : 

determined to be essential to the bacteria (i.e., the CFEs). The reverse screening method 
was used to determine which CFE polypeptides bind to which compounds in the anil- 

15 microbial deck. The reverse screening method included the following. The compounds 
in a proprietary compound deck were screened for Minimal Inhibitory Concentration 
(e.g., MIC). The compounds exhibiting antimicrobial activity were designated active 
compounds. The CFE polypeptides were screened to determine which polypeptide bind 
to which active compounds. The CFE polypeptides which bound to the active 

20 compound(s) were confirmed, where possible, i.e., in cases where an in-vitro assay was 
possible to construct, as being inhibited in their function as a polypeptide by the active 
compound(s) by examination of the inhibition profile of the compound(s) against the 
CFE polypeptides. For additional confirmation, the effect of the compound on the 
microorganism harboring the CFE polypeptide was monitored (e.g., whole cell assays). 

25 The structure of the active compound was used as a basis to generate chemically-related 
compounds by iterative synthesis. The chemically-related compounds were tested in a 
screening assay for binding with CFE polypeptides. The active compounds and the 
chemically-related compounds of interest were the compounds which exhibited an 
increase in bindinjg affinity for a CFE polypeptide and/or exhibited drug-like properties. 

30 
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t 

i 

The results of the reverse screening, are as follows. 127 compounds from the proprietary 
compound deck exhibited anti-microbial activity. 94 of these active compounds were 

1 selected based upon both lack of cytotoxicity and, lack of excessive hydrophobidity. 

i i ' 

These 94 compounds were soluble to 16 mM in deuterated- DMSO; these compounds 
5 were also deemed to be sufficiently soluble in aqueous buffer for both the 2D and ID 

r . 

NMR screening methods. ■ ' 

This subset of 94 compounds was used in an NMR-based screen to 1 determine which 
compound binds to which CFE polypeptide. The CFE 42 polypeptide bound two 
10 different compounds with Kd's in the; range of 0.2 to 1 mM; the CFE 43 polypeptide 
bound one compound with Kd ~ 30-50 pM; the CFE 34 polypeptide bound 13 
compounds, one of which inhibited the polypeptide function with IC50 < 10 |LtM. 

i 

1 , 

The enzyme assay used to confirm the 1 NMR results which suggested CFE 34 interaction 
15 with the compounds included the following: 10 \xM 14 C-labeled malonyl CoA; 20 \xM 
ACP, 30 pM CFE 34; 20 mM Tris-Cl, pH 8.0; 5 mM DTT; in the presence of absence of 
50 jjM of a compound solubilized at 40 mM in 100% DMSO and dilute 100-fold into 
10% DMSO and further diluted 8-fold for a final concentration of 50 \xM in 1.25% 
DMSO. The reaction was performed at room temperature, the reaction was stopped with 
20 the addition of TFA. Two hundred jal of the reaction was injected onto a Mono Q 5/5 
column. The chromatography conditions included: A) 20 mM Tris-Cl, pH 8.3; B) 20 
mM Tris-Cl, pH 8.3, 1 M NaCl. Hold 10% B for 5 minutes, linear gradient from 10% B 
to 50%B in 10 minutes, back to 10% B in 1 minute, hold for 14 minutes to re-equilibrate. 
The reaction substrate ( 14 C- malonyl CoA) eluted at 9.9 minutes, the reaction product 
25 ( 14 C-malonyl ACP) eluted at 14.3 minutes. 
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What is claimed is: » < 

(■ I 1 ' 

1 , ■ 

I 

• ( 4 

.1. An isolated' nucleic acid molecule encoding a polypeptide which is (1), essential 

i ' t • * i 1 

for the viability of a bacterial cell and (2) has at least any one of the functions of a 
pantothenate kinase, a Holliday Junction branch migration protein, a single 

stranded DNA binding protein, a phosphoglucosamine mutase, an 

i - * ' i( • 

acetyltransf erase, an uridylyltransferase, a malonyl Coenzyme A :ACP transcylase, 
, a 3-pxoacyl-ACP synthase 1 1 II, a * 3-oxoacyl-ACP reductase, a 
phoisphomethyipyrimidine (HMP-P) kinase, a GTP binding protein, a ATP 
binding protein, or a 4-aminoimidazole carboxylase. 

" . '. • ■ -■ 

2. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:97 or Figure 115 and wherein the polypeptide is a 

i 

pantothenate kinase. 

* 

i 

• i 

3 • The isolated nucleic acid molecule of claim 1 , wherein the nucleic acid molecule 
is shown in SEQ ID NO:35, Figure 60, SEQ ED NO:19, or Figure 44,and wherein 
the polypeptide is a Holliday Junction branch migration protein. 

4. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule, 
is shown in SEQ ID NO: 8 or Figure 33 and wherein the polypeptide is a single 
stranded DNA binding protein. 

5. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:3 or Figure 28 and wherein the polypeptide is a 

■ * 

phosphoglucosamine mutase. 

6. The isolated nucleic acid molecule of claim 1 , wherein the nucleic acid molecule 
is shown in SEQ ID NO: 82 or Figure 103 arid wherein the polypeptide is a 
acetyltransferase. 
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7. The isolated nucleic acid molecule, of claim 1, wherein the nucleic acid molecule 

ii' ' • 

is shown in SEQ ID NO: 82 or Figure 103 and wherein the polypeptide is a 
uridylyltransferase. ■ , , 1 ' * . 



20 



8. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:30 or Figure 5f> and wherein the polypeptide is a 
malonyl Coenzyme A: ACP transcylase. . 



9. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
10 is shown in SEQ ID NO:86 or Figure 107 and wherein the polypeptide is a 3- 

oxoacyl-ACP synthase n. . , 



10. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 

■ i 

is shown in SEQ ID NO:31 or Figure 56 and wherein the polypeptide is a 3- 
15 oxoacyl-ACP reductase. 



11. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:36 or Figure 61 and wherein the polypeptide is a 
phosphomethylpyrimidine (HMP-P) kinase. 

12. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:37, Figure 62, SEQ ID NO:48, or Figure 73, and 
wherein the polypeptide is a GTP binding protein. 



25 13. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 

is shown in SEQ ID NO:42 or Figure 67 and wherein the polypeptide is a ATP 
binding protein. 
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14. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO: 84 or Figure 105 and wherein the polypeptide is a 4- 
aminoimidazole carboxylase. . 1 



15. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid molecule 
is shown in SEQ ID NO:48 or Figure 73 and wherein 1 the polypeptide is a GTP 
1 ( binding protein. 



16. An isolated nucleic acid molecule encoding a polypeptide which is essential for 
the viability of a bacterial cell- the nucleic acid molecule comprising a sequence 
shown in any one of SEQ ID NOS:l-l 13. 



17. An isolated nucleic acid molecule encoding a polypeptide which is essential for 
the viability of a bacterial cell, 1 the nucleic acid molecule comprising a sequence 
shown iri any one of Figures 26- 130. 



18. An isolated nucleic acid molecule encoding any one of a polypeptide designated 
CFE 1-117 having the amino acid sequence shown in SEQ ID NO: 1 14-226. 



19. An isolated nucleic acid molecule comprising a nucleotide sequence which is 
complementary to the nucleotide sequence of claim 1, 16, 17 or 18. 



20. The isolated nucleic acid molecule of claim 1, 16, 17 or 18 which is DNA or 
RNA. 



21. The isolated nucleic acid molecule of claim 20, which is labeled with a detectable 
marker. 



22. The isolated nucleic acid molecule of claim 21, wherein the detectable marker is 
selected from the group consisting of a radioisotope, a fluorescent compound, a 
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bioluminescent compound, a chemiluminescent compound, a metal' chelator and 
an enzyme. 

i . ' . • , . 

• ' i 

23. A vector comprising the nucleotide sequence of claim 1, 16, 17, or 18. 

24. A host-vector system comprising thevector of claim 23, in a suitable host cell. 

* 1 

:■ 1 ' ■. • •■ 

25. The host- vector system of claim 24, wherein the suitable host cell is selected from 
a group consisting of a yeast cell, a plant cell, and an animal cell. 

26. The host- vector system of claim 24, wherein the suitable host cell is selected from 
a group consisting of an Escherichia cell, a Bacillus cell, a Pseudomonas cell, a 
Streptococcus cell, and a Streptomyces cell. 

27. An isolated polypeptide which is essential for the viability of a bacterial cell 
comprising the amino acid sequence as shown in any one of SEQ. ID NOS: 114- 
226. 

i 

28. An isolated polypeptide which is essential for the viability of a bacterial cell 
encoded by the isolated nucleic acid molecule of claim 1, 16, 17, or 18. 

29. The isolated polypeptide of claim 27 or 28 which is a fusion polypeptide. 

30. A method for producing a polypeptide having the. amino acid sequence of any one 
of SEQ ID NOS: 114-226 or a polypeptide encoded by the polynucleotide 
sequence as shown in any one of Figures 26-130, comprising: 

a) culturing the host-vector system of claim 24 under suitable conditions so as to 
produce the polypeptide; and 

b) recovering the polypeptide so produced. 
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3 1 . A polypeptide produced by the method of claim 3 6. 

1 32. A ligand whichi binds the polypeptide of claim 27 or 28. ' ,' ' ' ' 1 1 

. ' i ' 

5 33. The ligand of claim 32 which is an antibody or an immunologically active 

fragment thereof. ( 1 ■ 

34. The ligand of claim 33, wherein the antibody is a monoclonal antibody. 

* * ' * 

i * 1 

10 35. The ligand of claim 32 which is a diazalactone. 

! 

! i 

36. The ligand of claim 35, wherein the diazalactone comprises the structure: 




37. The ligand of claim 32 which is a iV-pr.otected amino acid. 

38. The ligand of claim 37, wherein the iV-protected amino acid comprises the 
structure: 




O- 



39. The ligand of claim 32 which is an azabicyclodiene. 

20 
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40. The ligand of claim 39, wherein the azabicyclodiene comprises the structure: 




.5 41. The ligand of claim 32 which is an alkaloid. 



( 

42. The ligand of claim 41, wherein the alkaloid comprises the structure: 

j 




10 

43. The ligand of claim 41, wherein the alkaloid comprises the structure: 
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44. The ligand of claim 41, wherein the alkaloid comprises the structure: 




CI 



i 

45. The ligand of claim 41, wherein the alkaloid comprises the structure: 
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46. The ligand of claim 41, wherein the alkaloid composes the structure: 




47. A method for detecting the presence of the polypeptide of claim 27 or 28 in a 
sample, comprising contacting the sample with a ligand which binds the 
polypeptide and detecting the binding of the polypeptide with the ligand in the 
sample. 



10 



48. The method of claim 47, wherein the detecting comprises: 

a) contacting the sample with the ligand; and 

b) determining whether a polypeptide-ligand complex is so formed. 



15 



49. The method of claim 47, wherein the sample is a cell, a tissue, or a biological 



fluid. 



50. The method of claim 47, wherein the sample is blood, serum, a swab from nose, a 

* 

swab from ear, or a swab from throat. 



20 



51. The method of claim 47, wherein the ligand is a diazalactone. 
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52. The method of claim 51, wherein the diazalactone comprises the structure: 



10 



15 




NO2- 



1 

53. The method of claim 47, wherein the ligand is a JV-protected amino acid. 



i.' 1 



54. The method of claim 53, wherein the //-protected amino acid comprises the 



structure: 




55. The method of claim 47, wherein the ligand is an azabicyclodiene. 



56. The method of claim 55, wherein the azabicyclodiene comprises the structure: 




57. The ligand of claim 47 which is an alkaloid. 
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58. The ligand of claim 57, wherein the alkaloid comprises the structure: 



i 

i 




59. The ligand of claim 57, wherein the alkaloid comprises the structure: 




60. The ligand of claim 57, wherein the alkaloid comprises the structure: 



10 
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61. The ligand of claim 57, wherein the alkaloid comprises the structure: 

■ i * i • ' 

i . 

' i • 

' 1 ■ i . ' . . . ■ 




i 



62. The ligand of claim 57, wherein the alkaloid comprises the structure: 

* * 

i 




63. A method for detecting the presence of a target nucleic acid molecule as shown in 
any one of SEQ ID NOS:l-113 in a sample, comprising contacting the sample 
with the complementary nucleic acid molecule of claim 19 and detecting the 
binding of the target nucleic acid molecule with the complementary nucleic acid 
molecule in the sample. 
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64. The method of claim 63, wherein the detecting comprises: 

a) contacting the sample with the complementary nucleic acid molecule; and 

b) determining whether a complex comprising the target nucleic acid molecule 
and t the complementary nucleic acid molecule is so formed. 



65. The method of claim 63, wherein the sample is. a cell, a tissue, or a biological, 
fluid. I 

i ■ i ..■ • 

10 ■ 66. The method of claim 63, wherein the sample is blood, serum, a swab from nose, a 

swab from ear, or a swab from throat. 



o7. a pnarmaceuticai composition cpmprising the nucleic acid molecule of claim 1, 
16, 17, or 18. ! 



68. A pharmaceutical composition comprising the polypeptide of claim 27 or 28. 



69. A pharmaceutical composition comprising the ligand^of claim 32. 

20 70. A method for determining whether a genomic nucleotide sequence of interest is 

essential for viability of a bacterial cell, comprising 

a. integrating an exogenous nucleotide sequence into the genomic nucleotide 
sequence of interest, wherein the exogenous nucleotide sequence 
comprises a portion of an open reading frame of the genomic nucleotide 

4 

25 sequence of interest, and 

b. determining whether the cell having the genomic nucleotide sequence of 
interest so integrated is viable. 



71. The method of claim 70, wherein the portion of the open reading frame comprises 
30 about 200 to 500 base pairs in length. 
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72. The method of claim 70, wherein the exogenous nucleotide sequence further 

comprises a nucleotide sequence conferring a, selectable' phenotype to the cell 

i • ' » ' i 

having the genome so integrated. 1 < ' 1 

i 

73. The method of claim 70, wherein determining comprises selecting the cell having 

the genome so integrated in the presence of a selection agent. ' 

'■■ ' 1 • - ' 

i ■ 1 ,. 

i 

74. The method of claim 73, wherein the selection agent is chloramphenicol. 

* 

• ■ 

75. A nucleotide sequence of interest which is essential for viability of a bacterial cell 
isolated by the method of claim 70. 1 

i 

76. A bacterial cell comprising an exogenous nucleotide sequence integrated into the 
genomic nucleotide sequence of interest, generated by the method of claim 70. 

* 

77. A method for determining whether a genomic nucleotide sequence of interest 
resides within an operon, comprising 

a) integrating an exogenous nucleotide sequence into the genomic nucleotide 
sequence of interest; and 

b) determining whether the cell having the genomic nucleotide sequence of 

■ 

interest so integrated is viable, and wherein the exogenous nucleotide 
sequence lacks an expression regulatory sequence. 

78. The method , of claim 77, wherein the exogenous nucleotide sequence further 
comprises a nucleotide sequence conferring a selectable phenotype to the cell 
having the genome so integrated. 

79. The method of claim 77, wherein determining comprises selecting the cell having 
the genome so integrated in the presence of a selection agent. 
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80. The method of claim 79, wherein the selection agen]t is chloramphenicol. 

■ ■■■ ,;. ■• 

1 i ' ' i , 1 

i , ' , .i 

) ■ ■ 

81. A method for inhibiting a function of a CEG polypeptide which is essential for 

5 viability of a bacterial cell, the method comprising contacting the CEG polypeptide 

• t i ' - • 

with the ligand of claim 32 under suitable conditions thereby inhibiting the function 

of the CEG polypeptide. ■ , , 

I I < , , 

I |- 

82. The method of claim 81, wherein the function of the CEG polypeptide is selected 
10 from a group consisting of a pantothenate kinase, a Holliday Junction branch 

migration protein, a single stranded DNA binding protein, a phosphoglucosamine 
mutase, an acetyltransferase, an uridylyltransferase, a malonyl CoenzymeArACP 
transcylase, a 3-oxoacyl-ACP synthase H, a 3-oxoacyl-ACP reductase, a 
phosphomethylpyrimidine (HMP-P) kinase, a GTP binding protein, a ATP 
15 binding protein, or a 4-aminoimidazole carboxylase. 

i 

i + 

83. The method of claim 81, wherein the CEG polypeptide is selected from a group 
consisting of CFE1 -113. 

20 84. The method of claim 81, wherein the CEG polypeptide is 2CFE 34 shown in 

Figure 55. 

85. The method of claim 81, wherein the CEG polypeptide is 2CFE 43 shown in 
Figure 64. 

25 
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86. The method of claim 81, wherein the CEG polypeptide is 2CFE 34 shown in 
Figure 55 and the ligand is: 




i 

5 87. The method of claim 81, wlierein the CEG polypeptide is 2CFE 43 shown in 

Figure 64 and the ligand is: 




NO2- 

1 

88. The method of claim 81, wherein the CEG polypeptide is 2CFE 43 shown in 
10 Figure 64 and the ligand is: 




O- 



15 
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89. A method for identifying a ligand in a sample which specifically binds a CEG 
polypeptide, the method comprising: 

a) contacting the CEG polypeptide with the sample under suitable, conditions 
so that a complex having the CEG polypeptide and the ligand is formed; 

b) recovering the complex so formed ; and 

c) separating the CEG polypeptide from the ligand in the complex and 



identifying the ligand so 



separated. 



90. The method of claim 89, wherein the sample is a tissue or biological fluid. 



I.* < 



91. The method of claim 89, wherein the ligand is an azabicyclodiene. 



92. The method of claim 91, wherein the azabicyclodiene comprises the structure: 




OH 



93. The method of claim 89, wherein the ligand is a diazalactone. 



94. The method of claim 93, wherein the diazalactone comprises the structure: 




NO2- 



95. The method of claim 89, wherein the ligand is a TV-protected amino acid. 
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96. The method of claim 95, wherein the iV»prdtected amino acid comprises the 



i i 



i i 



structure: 



i ' 




97. The method of claim 89, wherein the ligand is an alkoloid. 



98. The ligahd of claim 97, wherein the alkaloid comprises the structure: 




10 



99. The ligand of claim 97, wherein the alkaloid comprises the structure: 




15 
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100. 



The ligand of claim 97, wherein the alkaloid comprises the structure: 



' i 




101. 



The ligand of claim 97, wherein the alkaloid comprises the structure: 
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The ligand of claim 97, wherein the alkaloid comprises the structure: 




: i 
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fx • . 



^cygjh'homologue of SEQ«* ID jNO. 1 ": i • ' .; 

AT SATTtrATOCAGGAArTCTTGCC 
TJT TAG, LOCTACK/TCATCGACCTAtT^ 

AA \ ATT STAGaTTGGTGTTCAf GGAGACTCGGTTTC TCATGCAGA AGATCTTGTAGATAAATATCTTCCTCT 

Tti iTAA 5G AACGTATC ATC ATTACAAAGGGTCKjTG 

GO ZATt 3 ATGCTTATCGTCCGCTTACTCCAG AGGATATCGTTGTTACC^ 

AC ^CTT :GCATGATI€AGGACAATATCCAACrTGCCCAAAATC^ 

GC 3GTT SATACTATCGTTGAAAGTACCAATGGTCAATTTATTACAGATArre 

TP tAGG \CAAACACCTCAAACATTCCGrTOC/U^ 

<3k 3AAC GAAATCTTOACAGATGCATGTAAAATCTTTGTGAT^ 

Gp raAArACTCAAATCTGAAGATTACAACCGTAACAGATTTGA^ 

QArrAG 



2CF» homologue of SEQ ID NO. .2" 
AT< 3GCTMCGTAATTATTGAAA AAGcTaXa AGAGAGAATGACCCATCTCACCAATCACtTGGTCGTGAAT 
TT( iGTq< iTATCCGTGCTCGTCGTGCCAATGCA AGGTTGCTrGaCCGTGTACATGTAGAATACTATGGAGix: 
GA VACT :CTCTtAACCAAATCGCTTCAATTACGATrc 
CL tGTQ' TCATTGAAAGACATCGACCGIXiCCTTGAACGCTTGTGATC^ 
GfeTTCTCrTGATTCGCTTCGTrATCX^ 

' AQXaGG rCGK^GAAAATGCTAAAGTGGCTOTCCGCAATATCCGTC 
AACaKGAAAAAGCAAAAGAAATCACTOA^ 

acagaq jATGCTGTTAAACACATCGACGAGaTCACTGCTAACAAAGAGAAAG . 

'£Cf];3 ."Horriolo'gue-* *?f SEQ.- ID- NO. 3"' 
ATC rGG^^ATATTTTGGGACTGATGGAGTCCGTGGAGAAGCTA 

TT/ aActaggacgttttggaggctatgtt^ 

\ ACC rTpAtpACACGTATTTGAGGGGAAATOGTO . 

caogtatacaaacttggtgtcctt^ 
. ccc fgtgtcatgattlxn'ggtagccacaaoccagccc 
cttcaaacragatgatgaaaaagaagcagaa 
. tgg f cca agtgcagaaggcttacwaatm 
oo ctt sirrf -aactggaactcctcttcatggaatgaaggttgccttgg 

. CAC T6CC CGTQAAATCTTTGOAGACCTTG 
■ AAC AfC/ ACCTTAATdirGGTTCAACACATCCAGAAGCCCTTCAAG 
rp> , GCI ATTG GTTTGGCCTTTCATGGAGACAGTGACCGCTTGAT^ 
^ ATC GTG> CAAQATTATGTACATCATCCK}AAAATACCT^ 
TGT 3 ACA ACTCTTATGTCr AaC^ 

. . Aqi QCAC rTGGTGACCGCTACGfTGTTGAAG AAATCAGAAAATCAGGTTACAACCTTGGTGGTOAACAGT 
CTu 3TCA CGTTATCTTGATOGATTACAATACCACAGGT6A 

A/\|T CATG AAGGAAACTGGTA AGAGCTTATeAGAGTTGfGCGGCAGA AGTAACGAXTTATCCACAAA AATT 

AGT TAAt atccgagtggaAaacgtcatgaaggaaa AGGCCATGGAAGTGCGAGCT ATCAAGGCCATCAT 

CqA OA AC ATGGAAGAAGA AATGGCGGGGAACQGGCGTATCCTraiTC 

CTt JCGT 3TTATGGC AG A AGCOCCTACAACAGAAGAAGTAG ACTACTATGTTOATaCCATCACAGaTGTA 
GTTOGTG ZTGAaATTGGGATTGACTAA . 



2tF; 



yjl "bopio^ogue of SEQ. ID NGf* 4" 
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AiTGAAkAAAATACTAATTGTAGATGATG^ 
A, ^GT rATGi^GTTGTAACTGCTTTTAATC 

TV TTA{' TATTCTGGATTTGATGCTTCCAGAAATTGATGGTTTAGAAGT^ 

Q( :A«bT( iTGCCCATTCrrATGCTTTCAGCCA AAGAT AGTGAAmGATAAGGtTATCGGTTTGGAACTTGGQ 
G( : AGA rGACf ATGTA aCGAA ACCCTTCTCCAATC^ 

6' TCTC AACCTATGCCAGTAGATGGTCAGGAAGCAGATAGTAAACCTGAACGTATCCAAA^ 
v-^w AGaAA rTGTTCCAGACGCCTACGTGGC^ 

H* lGCTT TTGllATCATITAGCACCGCATACAGGtCAAGTCATCACGCGC 
GOGGTT ATG^TATTTTGGfGATC 

AC IATA< IGCCGIaGCCGaCCaGAGT ATATCTTG ACGCGCCGTGGTGTAGGGTATTACATGAGAA ATA ATGCi; 

tca ;][...',•. r," . . • ■ * ' . 

* 1 j : 

. 2 CIES' "homologue'.of SEQ. ID NO,! 5" 
41 GGaAgaAATTCTCTC^ 
CC CCCCkcTTfiiGCACTTGAA AAAGGT^ 

CC ACTACAATQA AATCAC AGATGTCCAGTTGACGGACGATGATTTC , 
CA CAGTCATGCTTTAGTGGTC 

AC GTTT CGTClTCGGdCAATGATGTCCTCTTGGTAGGAAATA AA AAAGATATCCTTGCTA AGTCAGTTAAO 
TIC TGGT M,GA^AGCCAGTGGCTCATGAAACGTX3CCCATGAAGAAGGTCTTC 

TA ACTT SaGCXGAAAATAAaTATGC^ATTAAGGAAGTCATTG^ . 
GC GATp rcrAnPGTCGTCGGTGTGACCAACGTTGGAAAATCAACTCTAATiCAATGCT 
^ CA CGGC TGATCAGAATGTCAIxrACTACrit^CCjCTl lCCA<^GACAACCTTGGACXAAATAGAGATrCCO 
1 5^ CrrGA<P 3ACG£ATGTTATATTTACGATACGCCGGGAATTATCCA^ 

H> CG GGCA AAAACCTCAAGTATGTCAGTiGCTAAAAAGG A AATCAAGCCTAAGACCT ATCAGCTTA ATCCTGA 
GC *AAC CCrATmTAGGTGGTTTGGGACGCTTTGACTTTAtAGCAG^ 

Tr rrjrt rATAATGA actca A ACTCCATCOTAGCA AGCTTGAAGGAGCTAGTGCTTTCTACG ATAAGCACC 
. • TG 3GAApxrrfrCTGACACCAC^ 

TX X^\TTAAAGAtAAGACAGACCTAGTCATTT r 
GjT SGCCpTGTGGGCACCAGAAGGCGTCtt 

fltCjr S6 "homologue 6f SEQ. ID NO. 6" - 
AT< 3TATCCAGATG ATAGTTTGACATTGCAC ACGG ACTTGTACCAGATCAAGATGATGCAGGTTTACTTTO 
ACi :AA<P SGATTCACAATi^AGAAGGCGGTCTITGAGGTGTATTTCCGGCAACAG 
TQCGQT TITGCAGGTTTGGA^\GAATTGTGAACTATCTTGAAG 
CC 'ATT] <KjAGTCGCTTGGTTATCATGGGGCGTTCTTGGATTAC 
Qt CGT, GT GGCCAAGAAGGGPATTTGGTTTITGCTAATGAACCGATXGTC 
G<?( :AAK iTCAGTroGTCGAAAGGGCTGTTTTGAACAT^ 
A(^ }TCC}" ^ATTGGTTCGGTTATCGAAGATGA ACC^ 

GA'GCG<}CCATCTGGGGAACACGCGCAGCTGTGATTGGTGGCGCCAA . 
GQ' AAGi :TGTTTGACATTO:TGTtTTGGGA 
TG/ ACQ' TTCi^GGGTTACGCTGCGACCCACAAAAATTCTGTC 

GG> tidat rTGTACCAGCTGCr ATTCAGGTGGCGCGTOAGGTG«3TGATTAGATTAA 
G A r TGA* ITCTQGGGATATTOCCTACATTTCTAAG 
< GA< iGCft AOAfTTATGCTTtnAATGATTTG^ 
(T* AGJ TJTO TGTCTGGGGCGTGGGTACCAAGCTGATTACAGC^ 
x > CA/ GAT*] GTTGCAATCGAaGATGAAACTCGTCAGATGCGCAA 

A A/ AGT( nTCGACGCC^GGTAAGAAGCAGGTGTGGCGCATTACCAGTCGTG^ 

gtc act/ catcacttatgatggtgtggatattagcgacXtgacag A AATCAAGAtGTTCCATCCGACCTA 

TaJC ATAC ATQAAGaAGACGGTTCGTAA 



GAjT GAGjI ATAAGCGTGTGCTCA'ATCCGCAGCACTATCCAGTXiGATriXSGCGCGTCATGTATGGGAAGATA 
AGA TGG> CTTGATTGATAAGATGCGCAAGGAA 

2CFf:r I'homologu^ of SEQ , ' ID NO. 7" 
ATG jCTA-TATfCAATGGTITCCTGGTCAC^TGTC 

TTG' TGA! TTTGTGACGATrrTAGTAGATGCACGCTTOCCTCTATGTAGTCAAAATCCTATC 

» •« * ■ .... 
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. V 



ATfrGTTbGTGATAAACCAAAACTC^ 

KA tggc QTCA'GTATTTTGAATCACA aggaatccagacgct 

a/ AAq: tgtaacagatgcggccaagaagctcatggct^ ' 

TC AG fid rOAA ACCTTGGGTACCATG ATTATCGGGATTCCAAACGCTGGTA AATCCACTCTGATGAACCGT 
TT jGCT SGTAAAAAGATTGCTGTtGTTGGAAACAAGCCAGGGGTCACAAAAGGTCAACAATGGCTTAAA 
_«\ AC CAAl AAAGATCTGGAAATCTIOGATACACCXiGGGATTCTCTGGCCTAAGriTGAGGATO 
W CA CTTA AXJTTGGCATTGACTGGAGCTATCAAGGATC&GTTGC^ 

A1 GAAT rAT^AAAGAACATTATCCAGAAAAGCOXSGCTGAACGCTTCAAAC ■ 
GA AGCC CCTCTGATrATTATAGATATOACCCGC^ 
TC rGTTX IGTGAAGGAAGTCCGTGATGG^AAACTCGGTAA^ 
•GCCAACGA' 



rGAAC 

L-nkA 



2CJF1E8 "homologue of SEQ. ID NO. 8" 
ATpATrkACAATGTTOTACTrGTAGGGCGTATGA<^CGTOACGCTGAGT^^ 

TAtKSAGnGCtiACTnTACTCTTO^ . 
TT TATC IAATGTGGTTATGTGGCGCCAACAGGCTGAAAATCTTGCTAAC 
- At 2QGQ 3TGACAGGTCGTATCCAGACTGGTAGTTaCGATAACX:AGCAAGG 

v '2\ GA 3GT<C STGGCTOAGAATTTCX^AATGTTGGAAAGCCGTAGTGTG^GTGAGGGTCACACAGGTGGAGCT 
*£ TA OTCTOCACfliAACTGCAAACT^^ 

CAtTrG/iAGCAACAAACCCATTGGATATTTCAGATGATGATnACC^ •■ 

* t i • • i . * 1 * " • • 

* 2X^ll9 u homoljc>gu0 6ft SEQ. ID NO^ 9 JI * * \ : . 

AT< jAAAliOGC&TATTACAGAATTATTGA^^ • 
GT< 5ATCJ< jTGA'JtXTGGCAGCKSGCTGTTrc 

G\ VAGA ^GrTGTCAACGCCAATATTOATAAAATCAAATCATTG . 

AT< iCTC: TATCtCCCiriXSTGGAAGATATW^ 

A<* 3AQ<p IGGAAATCCAAGCAAGTATATGGAACGrTTCCATGAA 
^ . Cd Aplj< rTCGGTTTAGCTAAACGCATGGA 
^ QC GGG JGGcXtATCGGTAAATTAACAACC^TGAC^ 
v. CtC rTTAil TGCnWAGGAGGMTTGCGGATWTGAAGGTGCTGCGGCTGGCTTf 

£P> Tdl ACA< KHPGQGGA^ . 
^ TT} AAA A&C^GGOATATTGACACrAOT^ * 

ATC AGT1 GACTACJaOATTTTGAACTGGCTGAAAA AGATGCCITTAAG * 

GT1 TGA/ .CAaATOGGAGCAGGTGCOCTA^CaAAGCAGTTOTTCAGGGTO^ 

ATC iGCAiGTCAAATCGCAGGGCHTGTTlfcTAAA 
. A^( >GAGCXX3CTAAGAAAAra . ; 

5>CKl0 "hpmologue' of SEQ. ID NO. 10" • . - 

ATC ATGbATATTCAAGG AATCAAAGAAGCTCrt 

GO/ ag^gagcGaGgataccattgttgctatcaaaaatgtgagcatt^ 

TTT ECT<p. VATAGCCAGTTATGCGAGGTGTTCTGATTArGGAAGCCTTGGCGCAAACTGCTGGTC 



. gttUtcaJaaacctga^aTaaA^ 
^ gca agti gtaccaggcgaccaattggttatgacagcgacttttgta^ - 

GTT GAaC CAA AGGCTGAAGTGGAtGGC^AGGTTGCAGCCAGTGGTATCC TTACrrrrGC^TiGGGAACT 



AA 



I 

I 



2CJPjEll|jrhomol6gue of-.gEd. ID .NO. t.J " • 

ATG ATTA|ATCAjVaTTTATCAACTA ACTAAG^ 
ACCAAGAGAAtPATATCCTTATCCGTCCCAACTA 
^ . GGG AAAtACGXGATCKX^AGATTTTGlAATAA^ 

to CGjT 2ATT rCTGACrCGACCGtiAACcrACGAGGTrrGGTCAAAAAGtTC 

A$C [TAGA QTGATGAAGAAfTCTATGAAAACTACATCAC^ 
^ GGTITATIJAGA SAGnTGmCTCTCCCTAAAG lATCGTGTGGTGGCTTATGATGCnrATTGAAGATACOGTT 
"V GCA 3CCA TXAC. VQAGmGTCAGTCTGGGCATGCACGCTATGAATTOTCTATTGACtc^ - 
AGG SQGA GCGGATCGCC&TtATrCKJAGA 

TrT<j»CCAyAAGQAGAQATTGTQGTTAT^ ' 
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.2 CF& I* 



^ TA aAC/ caGCCTTTAAAACAQTGTITAAGTjGGGAAGTATAA' . 



2 S^d^A^^ 
£ ^^^TCCCAGGGAA AGTCGCCCTTCAATTK5 



cc 



<NC> 



AACaGC 



✓v-tr^Tr' -otgACTATATC^ 

cg&aSa^t^^^^ 

TAACGGXGAdAGTO^C^^ 
AA VAAC GACdAGCCC^CTGG^ 

CA, iAjTA rGAGCATAATACCTATGC, 




QAfrCTO 5ACTXTCGTTTQACAAAACTGGTTGAGTTGACCAACAATCGCTCTC 

U. TTTCC^ QGTCCCGATTCGC^ 

GTSn^&TCGGTGACAAGOAATCTACCCTTGTCTTCATr^ . 

c£ ^laa^aTCa^cSagcacc^ 

GG S^aStAGCTCG^^ 

AC S GGGTG^TCATTCTGAAATCGCTCGTC 

2a SaUcgagtaa^ 

.' ^SpScTTATA^CATGGTCGAATTTC 

. TGMCTAA- . •• • . . 

T^ATG^TAbCT^^^ 
Jol » ^ TCACC^ ACATCGT^ 

TCC Gt GC iTGGTCAAGACTTTGAACAAAGTATCATTOCAGACGACCTAC 
\t, 7 ^TCCAAAACGACGGtGTAGTTCTGGCTATCTGCGGTGGTTTCCAACTATTC 
G^A^CnCAGGAA^CGTAtCGAAGGGCTAGGGG 

A- 

CG/ A GAL/ l AAC50TCGGTG^M.V-7VJWV' A a wti tm« r»4^^-* — - «v— : - : . . * - 

TCT CGTA ATGCfcAATCTGGCTTATCGCCTAGTTACTACTGCC^TCAAGAAGAAATATGGTCAGGACATCC 

AAC TCCC tgcctatgaggatatcctcagcc^Agaaatcgctgaagagtacagtgacgtcaaaagcaagg 

CTC ACTT ITCTtAA " . . .' 



2CFtW , ■ . • ... 

ATjaAATdTAAAAGAAAATAGADAACTTGTTTTTCGAGAAGTIGCAGAGGCTAGT^ 

AGA^TGC 'TTCG^lCTCTXiTCATTGCAGTTATGAAGTATCTAGATG 

• GCTAOG1 GTTCATCATATCGGTGAAAATCGT ^bTAGATAAGTTXCTGGAAAAATATGAAGGTTTAAAAGAT 
CGA OATC tOAcH^CAtmSATTGGTACCTTGCAAAGACOTAAGQTGAMGATGTCATTC^ 
A 1fr*TTT<5CATOCATTGGACTCAGTAAAGC 

AGT 3TTT XTTGlAAGTAAATATTTCTA^ 
^ AAT :TTG jC AGHGTTAGCCAGACTAGATAAGArTGAATATGTTGGTTTA ATG ACGATGGCACCTTTTGAG 
X GCX \GCA GTCABCAGTTGAAAOAGATTTTCAAGCCGGCCCAAGATrrACAAAGAGAAATrCAAG 

CAA ATrcCAAATATGCCTAtGACCGAGTTAAGTATGGGA AT GAGTCGTGATtATAAAGAAGCGATTCAAT 
TCG^TTCCACTlpnr^^ . 
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2cklS "homologue of 1 SEQ- ID NO. 1 5^ • . 

\' ^gIuvTTGGTCTAGAAAATG 
G< iATG rn'CITTGACGATTOiiAQATGGCT^ 
AC TXTi TTACA aCTCTTAAATCWTTT ATTGGTGCCA AQTC 

Cj !cCTC G ACTTCTA AA AATA AAGAT ATTCGTCA AATTAG AAA ACAGGTTGGeTTGGTATrrCAGTTrGCT 
GJ XL rCAGATTTTTGAAGAAACGGTTTTGAAGGACGT^ 

A> lGAa 3ATGCTGTGAAGACTGCGCGTGAG AA ACTGGCTCTGGTTGGAATTGATGAATCACTTTTrGAT^ 
^ tagtcc GTTTGAGCTGTCAGGGGGACaAATGAGACG 
TC TATA TTAGTCTTAGATC AQCCAACAGCTGGTCTAG 
TC AAAU AACTCCACCAGTCACGGATGACCATX2GTC 
TC CTA/ TCAAOTCTATCTAATC 

AC acg' ^tttttatggJ^qaagttcagvtgggagtA(^ 
QJ taBj ggcotgtcatitaaacgattaccgatta^ 



(l 



II 



2CTlE16» '"homologue. of' SEQ. ID. NO. 16 . 

at gSa^attcIIatttitaggaacgggggctggtc^ 
TC AJCAC tcttogatgagattaAcgaagtttggct^ 

OC AAAOCACi&TrCGACCACGT^ 

gc the scagdtttcctttctAgccgtgcct^ 

OA CCTC \AGGA ATCAAGTC ATTTOTCTT^ 
^ Ca rtTC ^TGAGTTTGACCAAGATTCtCTGGGTAA AATTCTTGAA ACCGATAAATTCACTGTGTATGrCAGA 
• GC AGCI GG AOCACACTATTTTCTGTGTTGGCTATCGT6TC ATGCA A A AGG ATCTAGAAGOGACGCTGGATT 
^ qq xga^ A AACTC A AGGCTGCTGGTGTTGCGTTCGGCCCGC l l i'i'lGGTAAAATCAAAAACGOCCAGGATC 

Cr ttgtti}' t>gaAgacggaactgaaatcaaggcagcagactatatctcagcgcc^ 

^ CA :TAT rTTAGG AG AC^CTCGAAAAACGGGTGCCAGTCT 

CA rGAG rCCACTTATGGCAAGGGTdATCAAAAAATTGCTCGtAACCATGGTCA 

crl 3CAG ^AGTAGCGGTAGAAGCAGGTGCCA AACGCCTCCTACTCAACCATATC AGTGCCCGTTTCXrrCTC 
AA ^a!g/ TATTAGCAAACTCA AGAAAGACGCTGCCAC AAtTTTTGAAAATGTC^TGTGGTGAAAGACTTG 
G*UGA*GTGGAAATCTAG ' •'. 

^GFC17. ''homplogue of SEQ. ID NO. .17" , 
A?i 3 AGtJCaTA3TCA<3TTTAACA ACACirGGTGGTCTGCGTGAGAATGGAAAAAATATGTACATTGCTGA^ 
TT( rGAG- ^GTC^TTiTTGTtTTG 
GG' X>AT XSCAj^CATGGATTACCTTTTTGAAAAT^ 



AC< :ATtt JAGTtPGCAAAGCTCI^^ * 

TTC ATCW XjAATACGGAGATlft ATTTTGGTGGG 

GA< SAGT ;TGG&AATTGTCTTGAAGAC^tCGGAAGG^ 

AAj lCG<3 ;TAGTGMTCrrATGCAACTGATTTTGCTCGTTTG 

TCT CCTC \GTG ATTeGGCCA ATGCAGACAGCAATATT^ 



ATdGTCi&CAOAGCGATTCGTCTTAAGA^^ 



TCM GCA< ATTTTTGACGCTGCGGATAAAACAGGTCGACGTATCGTCnTGACAGGATTTGATATTG 



AT> TGTCTCGC^rroAAGACCATGAGTTGAtTATTCTTQAGACAGGTCGTATGGGTGAACCTA 
AG! TCGT /UGATGTCGATTGGTCGCCATCGTTATGTAGA^ 
Cp^ ACC GGTCCGTCt AtTGCTAAAGAAGCCTtTOTTGCGGGTGTGGAAAATATGAT^ATCAGGCAGGTGGGa 
^ TTG FGAA ATTCkmCCCAAAQTTTACATGTATCAGGGCACGGA AATGTGCGTGATTTGGAGCTGATOAT 
CA* TCTT rTGCAACCTAAGTACCTCTTPCCCt GT ^CCAAGGGGAGTATGGTGAGtTGGATGCTCACGCTAAd 
GCT Gqcjv TGGcSGTTGGGATGTTGCCiAG A ACGCATCTtCATTCCTA AAAAGGGGACGACCATGGCTTACG 
AG/i ATGC AGACTTTGTT^AGCTGGATCGGTTTCAGGAGGAGATATCITGATTGATGGGA^ 
TGA IXjTT jGAAATGTTGTTCTTCGTGA<XGTAAGGTCTTGTCAGAGGATGGAATTT^ 
CAG TCAA CCGTi^TGAGAAGAAAATTGTGGCTAGAGCTCGTGTrcAGACGCGTGGA 
. GA!A G AG1 CGGGATATTCTCCGTOAAAGTTCaG AATTGATTAACC^ 
AGA TGAC mGACTGGGCAGATCTCAAAGGTAAGGT^ 
ACC VAGfc arCGpcCAGCC^TTTTACCAGTAGTCATGGAAGCAAAATAA 



I 



r 
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AT fe A CrfAAACkAThc*TC^^ 
AA GGT/.TCTACGTTGATGCGACn^ 
AA AAG< CCATCTCTATGCCTTTGACCAGGATC^ 

TA CATT 3AGAAGGGA ATGGTGACCTTTATCAAGGATAACTTCCGTCATTTACAGGCAC^^ 
CT 3GTG nCAQGAAATTOATGGAATTrGTTATGACrrGGGAGTGTCTAGTCCTCAATT^ 
OC 3TGGTITnCfrATAAAAAGGA 
TA rGAA GTGGTTAAtoTTATCACTATCATGATITGGT^ 

CT 2TAA \GAGATTGCGCGTAAGATTGAGCAAGCGCGT^ aAGTGAAGCCGATTGAGACAAGGACTGAGTT 
V AG ^G/ GATrATCAAGTTCGTCAAACXTGCCAAGGAACTCAAGAAGA 

Vlr TT "CGAi jGCTattcgaattgaagtcaatgAtoaactggJgagcggcagatoagtccatccagcaggctAtq 

* CA rATC rTGGCTCTGGATGGTAGAATTTCAGTGATTACCTTTCAT^ 

AT rGTT( :aAGGAAGCTTCAAC AGTTGAAGTTCCA AAA<JGCTTGCGTTIt^TCCCAGATGATCTCAAGCXX5 

AA 3ATC GAATiTGGTGTCCCGTAAGCdAATCTrGCCAAGTGCGGAAGA 

CA 7XCA 30C/J\GTrGCGOG^GTGAGAAAAATTCACAAGCXCGAGGACCACC^ 

(i CF £21 "homologue of SEQ. ID» NO. ^9" } - *, 
AT jAGTkGAATTTTAGATAATGAGATAATGGGGGA 

TA' TTA toTGA ATATATCGGACAGG ATAAGGTGA AGG ACCAGGTAC AA ATCTTTATTGAAGCTGCCAA AA . 
TG* TATfccCAACGAACTGGGAG 

art jgta* scTA'rma aatg agttagagcctggggatgt^ 

Gf < JACK iGAAbAjGGTGCTrtATAGTGCTAirGGAGG 
^ Gff :AGT :GTAQt6TTCATTTGGAGTTAC 

er< :caa' 'ccgcjtacgggcacgttttgggattacaggc^ 

JSC GXAATK HXXl^GCGGACGGGAGATATTTTTOAGATGG 
1>'TA< JGXA* iTCGTQGGACCCCTCGTATTGCCAATCGTCIX^TCM 
> GG< 3AXt jGGGTAAT^ATGaTATTVVTTACCGATAAGGCT 
T0< >ACTV JKSTGGATCAAAAAATfiCTTC 

Ai< Ten tctgtgaatatcgccgaagagcgtgagaca^ttga . . 

aL ^ggt ttat catgcggacacggtctggacgggtggcgactgct aaggcatatgagcacttaggttatg 

A A* ACA<>TGAAAAaGCGGCCGCACTC0AGCACC^ 

. "homoloque of SEQ. ID. NO.. 2D'? 1 

ATC AGfATGTnTTAGATACAGCTAAGATTAAGGTCAAGGCTGGTAATGGTGGCG 
' TTCQTCGTGAA^AATATGTCCC^^ 
CTT CGTT inAGACGAAGGACTACGTACCTTGATGGATT^^ 

GTC AAAv lAGGQATGACCAAACKjGATGCATGGTCGTGGTGCTGAGGACCrrAGAGTTOGAGTACCAC^AG 
• G1A CQAC TGTT]<?GTGATGCGGAGACTGGCAAGGTTTTAACAGATTrGATTGAACATGGGCAA 
CGI TGCC CACXKJTGGTCGTGGTXSGACGTGGAAATATTC^TTTCXJCGA 

ATC TCTG AA AATGGAGAACCACMSTCAGGAAGGTGAGT^CAATTGGaACTA AAAATCTTGGCA . 
V> GC1 TTAA TAGGATTCCCATCTGIAGGGAAGTCAACACTtTTAAGT^ 
^ TGC TGCCTACEACTrTACCACTATTOTACCAAAT^ 

CAG TAGC CGACTTGCCAGGTTTOATTGAAGGGGCi AGTi^GGTGTTGGTTtGGGAACtC^GTTCCTCCO 
W- TCA CATC GAGCGTACACQTGTrATCCTTCACATC^TTG^ 

AGC ATTA CCTAGGTATCAATAAAGAGCTGGAGTCTTACA ATXTrTCGCCTt^TGO AGCGT^ACAGATTAT 

TGTXACTVaTAAGATGGACATGCCTG^^ 

AAA TTAT SATGAAmGAAGAGTTACCAGCTATCtTCCOA ATTTCn«GATTGACCAAGCAAGGTCTGGCA 
ACA CTtT :AGATGCTACAGCTGAATTGTtAGACAAGACACCAGAA 

TGG A AGA AG AAGTTTACTATGGATTTGACGA AGA AGAAA AAGCCTTTGAAAWA^tCGTG ATGACGATG 
CGA 2 ATG GGTA }HTTCTGGTG A A AAACTX:ATGAAACrCTTTA ATATGACCAACHTTOATCGTG ATGaATCT 
GTC, VTGA \ATT toccCGT^GCTTCGTGGtATGGGGGTTGAtGA aGCCCTTCGTGCGCGTGGaGCTA AAG 
ATC SGGATTrGGTCCGCAtTC^ 
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ICltasi "bQm,Qlp.gye. pf.SEQ. . JJ) A .NG*. 1 y 

AT SAAC TACTiTTAATQTTQGGAAAATCGTTAATACGCAGGGATTACAGGQTGAOATGCGA 

TG ICGC ATTTTGCAGAAGAACGGTTTAAAAMG^ 
^ > TG TGCA AACAqTGACCATCGCTAGCCACCGTAAA 
^ TA :CA+ VTCAATACTATCGAAAAGTACAAGGGATACAGTCTG^ 
{-^ CT ^GACSATGkjTGAATTTTACTATCACGAGATTATCG 

Vk OA agc/JtcaajOgaaatcctgCaaccaggtgctaato 

AT r^CimACCTTATATCCCACC^GTGG 

AjF 2TTXG AAGGGTTAGACGATGAA ; , 

t 2CF326 ^'homologue of SEQ.IDNO r . 22" . 
AT 3AA^TTGATATmAACCCTCTTTCCAG AGATGTT^^ 
TCi 3AG AaAaiGGGCTCTTGGATATTC^ 
GA TGATCAGCjteTAGGGAGGCGGTCAGGGC^^ 

CTi -TTGi LAAACfAAAAATCCGCGCG^ATTCTCCTCGATCCT^TG 
^ T(5, lAjQA rTTO^CTCA AGAGGAAGAGCTAATCTITATCTGTGGGC^ 
fca AA3AGCrTGGTAACAGATGAGATITCCCTAGGCGACTATO 

C^ATOA ITGAWCTACAGTTCC^ 
^ TT TTC7 TCAGGTCTTTTAGAATATCCTCA 

AH iTA'tf K3ATGAGTGGTCACCATGAAA AGATTCGTCAG^^ 

C& U3CG :AGGCCGGATrTACTTGA ACATTATtAACTGaCAGTAGAAGAA 

CaAAGA ^AACAAaGAAGCGGCCCK^CTCGAGCA 

i *i » * \. . . 

' 2*C^^ "hdmolpgue. of SEQ. ID NO. 23" * . " ' ' 

. . AT< S ATt<S AAGCAmAGTAAATTA AAAGGTGGTATGACCTTTCAAA 



CTCfGTTC rAGATTTGACACAAGCTACCGTCCAGAGGAAAAATTTCAACAAGGt 

AG< :TCAi .t AcmsiAC AAAAT GGATGACACAGCATAGTTCATG A ATACAGAAACTTATG ACCAATACGAA 
ATC :CCTC TAGTpAATGTTGAAAACGAATTGCtTTACATCCTTGAAAACTCTG ATGTGAA AATCCA ATTCTA 
t *^> C<KiAj/VC GAAOTGATCGGTGTC^ 
l >. ATCAAAOGTGCTACTOTTAfcA^ 

CAC iAjEr CATGGAAGCAGdACAAAAACTC<3TTATCAACACTGCAGAAGGAACI^ 

cg>,gc^<x:Accaccaccaccactga . 

. ^yi28l"hpmologue of SEQ.. ID NO . 2,4 " 
. ATC GC^+rtxL^ 

. CTC AATC TGATGTCCAAdAGGd^CCAAAGAAATTCGCTTGGCCTTpCTC 
TGI TQTA AAGGACTTTATCA AGAA AGTTAGtGAGCGT^^ 
CCI GCGC AACAGATTATTAAAATCaTTGATGAG 
TTA IXjA^ GTCA^CTAAGATTCCAACC^ 

TGC TGGT AAATTGGCCAACAAACTCAAGAAAGAAGAAAATGCTCGTCCTTrt 
TAT CGTC C^GCTGCCATTOACCAGCTTAAGACCTTGGGACAACAGATro 
A AC AOA> <3T AdCAGCTGTTGAGATIOT^ 
GTC riGA TTOAfrACTG^ 
<^ cat rqbc K^CCAA ATGAAATG^GCTTG 

GCC TGAC mAATGCTCAGTTCGAAGTGACTGG^ 

M , ggt gctgctct otctgttcgtc^^^ 

rp CGGACATrGAAACCTTCCACGCAGACCGCATGTC^ 
^ G^T TGAC AAAGfcTTCTCAGGAATACGA^ 

CAE ZtTt< 3 ATTIJ TAATG ATTTC ATOG ATCA ATTAGAt CAGGTGCA AAATATGGGGCCGATGGAAGACTTG 
GTCA^GArGATJTCCAGGTATG^ 

-GOT :gta ^ACGTCK^ATTGTGTCTTCGATGACACCTGAAGAGCGTGAAA 
GCfc 3TCG DCGTCGTATf GCTGCTGGTTCTX3GAAATACATTCGTCGAAGTCAATAAA 
. AACSAGGCTAAACAGCTCATGC^GCKSTC^ 
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f< ;ATC 3GACAAGGCGGTATGCCTGACTTATCAGCTCTCqG AGGAGCAGGAATGCCAGaTATGAGCCAGA 



5 .'^ T< 5TTT< I GTGGCGGTITGAAAGGTAAAATTGGTGAATrrGCCXTGAA ACAGTCCATGAAACGTATGOCTAA 
\§, V ^ pj 'iAAA TG AApAAAGCGAAGAAGAAACGCAAGGCGGCCGCACTCGAGCACC^CCACCACCACCACTQA 

i-C 7 ES9 homqlogue of SEQ. ifiD NO. 25" 
A :GTAtCTTATTGAAAm 
r CCA< rTAfcAGGTCACTTGATTTTAG 
rOCATGOTAATGTCGTGA 

C( TTTl AAAOfcGACTAAGGACAAACAGGAAGTTCGTAAGACTTGGAGACTATGGTTG 
C( !AClh TGCCTTTACTTGGTGTCTT^^ 

n gctc tcatIottgattatctagggggttgccitcatctatttgg 

to Gj iGCCi iAGTGTAACAGAGTTGGACAAGCTTCCTTATACGAC 

. fc craTTTACCAGGGACTAGCeGTTCAGGTGCAACGATTGTC 
VlS* CJ .GTlt TGAGAGAATTTACCTTCTATCTTGGGATTCCGGTTATC 

V ^ L Am GTGA AAGCCGGAGAACTCTTGAGCTTrGGGCA ATTCTTTITGGTCTrGGTCGGGAtGGGAGTAG 

ci nrr<S sggtoaggatggtggctattcgcttcitgaccag 

<K rTAA!^ lTACGGTATCGTGCTTGGTAGTGTTTTCCTACTTTACAG 
C/ CCAC CACCACGACTGA 

' i . : ' ' 

: ■ . I ..-. ; .| . ... * * ' * * 

2 Cf{e30' '"homologue of SEQ . ID NO.' 26^ ' 
AT GdGWTTATTTGACCGTCTATTCGGAAAAAAAGAAGAACC 

Ct 3G AA AATCTTGATTTGTCTGAAGATGTTG ATCCTACCTTC ACAGAAGTTGAGGAAGTTTCTCAGGAAG 
AA GCA< AGGTTGAAATrcTTGAACAAGCTGTGTTCCAAGAAGAGGAAATCCAAGA 

GT CTG$ATTtJW^ . 
. GA ATAC TGAG1TTTCTAGAGACTATAGAAGAAAATAATTCTGAAGTTCTTO 

AG AAG/ ^AACGGTTCAGGAAAAATATGACCGCAGTGTTAAGAAAACTCGTACAGGTTT 

GA ATCC CTTCrTTGCf A ACTTC*Xk:TC^ 

TO KdlG ATGTTGGTCTCCAAGltGCTTCTAA 

TG :CAA 3AAACCTG AtGCACTTC^ 
VX AG 7TAC jATGAaAGCATCCACTTXT AAGATAACTTGACAGTTATGCTC 

. GG \AAA CAACTTCTATCGGAAAACTAGCCC^CCGCTACAAACAA 
r^V CX 3CA(fimcblTCGGTGCGGGTGCAGTAGCTeAGCTAGCTGAATGGG 
^ AG TAAC rGGACCTGAAAAAGCTGATCCAGCCAGCGTGGTCTTTGATGGTATO 

GG" TATCi }AT ATTCTCATGATTGATACTGCTGGTCGTCTGCAAAATA AGGATAACCttATGGCtGAGTTGQ 

AA. VAGA TTGGtCGTATTATGAAACGTGTTO^ 

TX?> ,AO*< 5GTCAAAATGCCCTAGTACAGGCCAAAGAATTTTCGAAAATC 
TXJi- kCTA!> IGATTGATGGAACTGCTOGAGGAGGTGTC 
AA< lATIf< 5ATTG<5TTTT<K3TGAAAAAATCGATGATATTGGAGAG 
CTC TTG* AAGGTTTAATCGCGGCCGCACTCGAGCACX^CCACCAC 

,1, « , * * * * • 



0.CFI 



31 



homologue of SEQ. ID NO* 2 7" 



atotatAttgaaatggtagatgaaagtggtc^ 

TGCAACTTOC^GCCCAAAAATTaGGAAAAGAAGACA^ 
GT/ GTG/ XGAACTTA ATGTGGAGTACCGTGACACCGACCGTGCGACA 
' ACC AQa\i kTTGGAAATTGCCTTTGACGAAGAGGATTTGCTTGAAAATCCAGAAtTGGGAGAGATC 
V «^ GaC ITTTC ATGqCTATATTGGGGAATTGTTCATCrCTATCG ATAAGGCTCATGAGCAGGCCG AAG A ATATG 
H GTC ACAC CTTT BAGCGTGAGATGGGGTTCTTGGCAGTACACGGCTTTT^ 

TAT ACTC CGGA AGAAGAAGCGGAGaTGTTCGGTTTACAAGAAGAAATTITGACAGGCTATGGACTC^ 
AG/ CAAf. TCG^jjCACCACCACCACCACGACTGA 



I 



homologue' of SEQ,. > ID NO. 28" 



..ATE AGTA|TTCG^GTAATTATTGCaJGTTTtAAGGGA AAGATGGGCCAGGCTGCTTGTCAGATGGTATTGA 
V . CTGATCGAGACffTGGACrrcGTG 

. ' GTT ^/{AGGAT AAGGCTOAmAGCTGGTTITGAAGCWGATGTCTC 
^ TGC ?TA(jj5AAAATACACGTTTTGCTC^ 
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^ A^AA(!}AAAlfTGCAGAGCTAAAAGAATTTTCTCGTGCCCAAGACTTGGCT 
« TC CCTIGGGTQCTGTCTTACTCATGCAATTTGCGAC^ 
^) fj GAG( TCCATCATGACAAGAAAAAGGATGCTCCGAGTO^ 

m C iQ A G(iTrCGAGAGTCCATrCAGCAAGGCGCAGCAGATGAGGAAGAGCTGAtTG<^ 

O.T CiACI TTG ATGGTATG CG CATCC ACTC AGTTCGTTTGCC AGGCTTGGTAG CTCATCAAG AAGTCATCTTT 

. g> iCAA" ^AGGGAGA AGGGTTGACOCTCCGTCATGACTCCTATGAf CGCATCTCX^TTCATGACAGGAGTCA 
3 tTOCJGAATTAAAGAAGTTGTCAAGCGTC^^ 
' QCACC^CC^C^CCACTOA , ! .' . ; i 

2.CSZZ31 "homologue of SEQ-j. XD. N(X--29." < . i 

& GC AGCAGTOAAGCTGTATTTGCAGCAGTAGCtGAGTATCTTG « 
>iv GT T-rTGOTAAfcTTTSAAGTTCGTGAGCGTGCAGAACG AT 
\^ CAATjTGCA^CTTCTAA^GTACCAGCATT^ 
jn ^CAqCACC?ACCACCACTdA 



Xlft/cikAAACA^^ 



CA n2GAirACGGAAGAAGACAAACTCAATCAGACCCGCTATACGGAACCAGCCATTCTAGCGACTTCGG IT 
GC IATC rACCQmATTOCAAGA AAAGGGGTATCAGCCTGATA^ 
CP ?TGC ;TTGdTGGC^GCGGCGCCTlGGATTlTGAAGATGGGGTTGCCTTGGTAG 
TA" TATG JAAGkAGCGGCTCCTGCTOACXCTGGCAAGATGGTAGC^GTTCTCAATACGCCAGTA " 
TT< }AAG ^AGCCTGrCAAAAAGCTTCTGAACTrc 
AA rOGT ;ATTGCTGGAGAAGTGGTTGCaGTTGATCGAGC 
JH, CG TTG- ^TTCCTCTTAAGGTGTCAGGTCCCTITCACACCTCTCTCCTr 
Vfc TG, U&d]rCTG^TCAGGTAAOT 

TCCGGTVAtUUAAUAflW-AUVW\lAAWvn«w 4 » » w W - — . » . " www. w.w - — - 

Gr AAA. VAAATTGATCAAACTGCTGACrTAGCTCATGTGGAAGATCAAGCGAOTTTAGTAGCACTTTT 
AAAAACrCGAGCACCACCACCACCACCACTGA 

Qcil'SS . "homologue of. SEQ*i- £D NO.. 3^ 35, j • t \ , -j ,. 

AKikAACTXGAA^TAATVAATATCTTTATTACApGTTCG^ 

A<y TTGC ^CAAGC^GGAGCqAAC^TTGTwTTAAACAGT^O^G r . 
3fl TTCA AACTATGGTATCAAGGTGGTTCC^ATTTC^ f 
AS$Ain$ATPMGW 

|CT^^^^A^f^ri{AC T 



fATTG 



(^S GTJGTtCGTTTPATGGGQAAT.ATW r ^- n - - rT - 

Ck/ GTCWTGGCACGCX5AGGTCGCT 

tctJgatatgacAgctatctt>^^ 
" jcagagcaggttg 



GQA 
TGC3 



C^TGAAAGAAJ? 

:Ajlc^5^^|^r ^ 



pfcc ATTp^TGGT^TT^AGT^ 

i * * * 

OCFSiM i "homologue of. SEQ. ID NQ. _32" L 

ATC GGA^TGAAAAAGAAACTAAAGlTflACTAGTTTGCTAGGACTGTCTCTGTTAA J 

CGA CTAATGOGpTAACTAGCGATATTACAGCCGAATCGGCTGATTTTTGGAGt^ T 

GCG GAA> TCA^T^GCTTTTTATCGTTTGATATTAGrATCG^ T 



ttaa qgcgCttcgagaacaatatccaggtcgagatatgga aagcagaaccaaactagagcaggaa K 

■ rt»v 3TAA(aGTA nTAAAGAAATGGGTGTCAGACAGTCAGACTCTCTTy^ 
- V^GGTrATT !TGGCCCTGTTCCAAGCCCTATCAAGA^ 
ACCrTOGTAGTGTGGAtACAACCC^ 

TTG" "CCA, ICAMGCTTtGTCTGAGCGAA ^ATGGCGCTACGACTGCGATGATGTATGOGATTCCAGTCTTGa 
TTT! TAT< TTTGpAGTTTATGCGCCAGGTGGAGTCGCCCT^ 
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^ X fGGA *ACCTATTTC7TGAXta ATCCATTCAAGAT^^ 
t "Q \ r h AAGATITGGi^AATAGAAAAAGAAAAGCCAAGA 



C U2CA 



tCACJtoA 



2C ?E37i homologue of SEQ. ID NO. 33" 
A' CAAbAlTAGTAAdAGGCACTTATT^ 
T GAT GTGOTCTATTCGA 

A, VACG VAGGlAATCTTTTGGATTtjGTAGTTTGATAC 

IX JAC A ^TGAGCGACTAATCATTTTAGTTATAT^AAf AGA AATGCIT^ 

G< }TAT TCAGrAAAGGGGGCATACGdTTGGATTTCGGTTGCAGGAGTAACT ATTCAGCCAGCiGAGTACT 
TV AAA. ITCACTATTATITGGTATTTAGCTCACCGATTCTCCAAACAGC 
r TTCMGTnnGACTCAAA^ 

& fl*rc}< i AAGTTTGGGAATTTTCCCTGAT^AGGAA A 
A/! ACA( iTTAGTGiOAATCGCTrATCGCTGGTTTrc 

^ TC TTG/ CCACTATCAGCCTAATCGGTGTTGAGACCTTTICAAAA ATTCCAGTATITGGCTATGTAGCeAAA 
^ CC CTTI AGTGCCTirrTTAATCCTTTTGCCGATCGTGCTG^ 

y£\ G< X^ATC GTCAATGGTGGTtGGTTTCGTCTAGGTCTTGGAAACTC 
v > U >GCT( ATAC^ACTTTGTCTTTTCrATCGTGA 
CI CtCT rGrn^CATGATTTTCCGG AT^ 

TT OCAC rCGG^GTCGG AGGG ATGATGTTCGTTCAGGTATTTGTC^ 
CGATOJ A.CAG6AGTAACCTTCCCCTTC1TATC 

AC CCn rGTCTTAAATATTGATGCCAGf GAAAAACGCGCTAAGTrGTACCGAGAAITGGA AA ATCAACCA 
Al OAAjC CTTCfroTTGAAGCTCGAGCACCACCACCACCACCAGTCM 

2CFE38i "hpniQl.Qgpjs.of .SEQ. ID NO. 34" 
. AT GCrcbGA AlTTTTA ACCTTTAITC 
CT !TpC :AAG AA ArcAGGGATTTTAGTACGra 

TT 3GGA ^GGXTGGAACG.GCCTATACCATTCGA ATCTTGCCTdTXSGGTCGCTATGTCCGCATGGCCGGTTG 
GG OTO-A TGATACAaCTGAAaTCA AGACAGGAACGCCTGTTAGTTTC^ 
AA AGGC ATCAATCTCTXlAGGTAAAAAATrcGATCAAAi^ 

TT< 3AK6 kCA AGCtCTTTATCAMGGATTGGTTCTGGAAGAAGAAAAA ACAm 
. A*. wGGTtrGTGCJMGCAGATGGTACTGAGGrrcGGATTGCACCXTTAGATGT^ 
AtD ^GGGGCAAAOTGATTACCAATTITGCAGGTCCTATGAACAAT^ 
GG tTTTi LATC^ATGCAGGGTGGTGTXUGA^ 
V) CC' TGG< ^AAdGTAOGAGTACCAGAAACGGCAGAAATTACCAAGATTGGCTCAGATO 
GG 3AAA GGTTGATCCAACk^TOTGGAAACAGAAACCAAAGATAAGA 

Vfc m CTG'/ AAAGGGGAGTGACAAACAAGTCACTGTTACACCCGAAGAT 

TO: TC>y GCG<^GGTTAAGTCAGATITTCrAT C^ . 

CTC JTCX3C AATTCTCTCAGGTCTGAAAAATGTGATrrTCCAA 

GM7 r ATCT TTAAGGCAAGf AGTGATGCTGCTAAAAATGGAATTOAGAATATCTTO 

TTI CCA7 CAATATTOGGATTTTTA AlXSTTATTCCGArrdCAGCCTTGGATGGTGGTAAGATTGTGCTCAAT 
AtC :CTACMGGCATCCCk:CGCAMCCATTGA AACAAGAAATTGaAAGCTATCTCACCTTOGCCGGaGTO 
GTXA^C>TGGTTGTCTTGATGATTGCTGTGACTTGGAATGACATTA 

cc/ccacicacoaccactga * . - 

' $JCFE39 % . "honiologue' of SEQ. ID NO. 35" 
ATC TACfoATAmAAAAGGAAT^^ 

TTG GTTA TATCCTGC ATGTGGCCiA ATCCTT ATGCCTATTCAGGTCAGGtTAATCAGGAGGCn^GAriTAT 
GtG CATC AGGTTGT^GTGAGCACGCCGATTTGCriTATGG 
^ TTCtoAGrCTAATTTCGG 

lGOAj 



AAC AGOC CAGdA.GATPGTGCTGGACTTGG AAGGGA AGGtAGTAGTTGCAGGAGATGGCCTTCCTGCCAA 

Vfc GGT CQCA gtgcaagcaagtgctgaaaaccaagaattggaagaagctatggaagccatgttggctctggg 
cta 3aag gcaacagagctcaagaaaatcaagaaattctttga^ 
tat 2aag|rcggcccrtaaaat<ntggtcaaactcgagca . 
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2-CBE40 "homologye of S£Q. ID: NO, 36" . 

a: 'gaa^aar^atcgtattttagcactttctgg 
g( icta( :ctat[accttgaacggcttgcatgggtttctagcagtga 
. gc iath gaag!tctttccaactgatgat 

tc ggg< i aatta ag attggtcttctccctactgtcagtgtggct gagaaggccn^ggactttatca a acaa 
cc ccc/ ggaotacctgtggtgttggatcct^ 

^ ACCJX^GCX^AGAGtTGATTCGCTTTl^ 

. n A TT4 rCCG^TCAGGA AATTAA AACCTTGG A AGACATGAAA ACTGCAPCGCAGAAATTGCATGATTTAO 

GJ GCGOCAGQ^GTCATTATCAAGGGAGGCAATCGTCTTAGTC^ 
H> TG G AC/ G ACqTTTACTATCCTAGAAiAATCCAGTTATCCA AG 

GC CTCl AGCA^GCCAGTGACTTGATTAAAGGTGATAAACTTTTGCCAGC 

TC STTt VTCGTOCTATTGCACAAGC AGATCAGTATGGAGTAAGACAATATGAAGCAAACAJVAAACAACC 

TCpAGC acca£caccaccaccactga . I. 

I • . • • • . i . * t . . 

2.CFE41I "homologue of SEQ M ID NO. 37" I i , * . 

XT 3ArteAAA*CGGAGAAAAAAGAGGAGCGAGTCGTGCTCATTGGTGTGGAATTO 
'7T rqAO iTCTCCATGGAAGAATTGGCTAGTTTAGCGAAAACGGCAGGGGGAG^ 
OA C AA> AaCQ*GA AAaATAT&ATTCCAAGACOT 

TG[G ATGC A£ AaGA AATt^CTACT 
G,A GG A/ GTTCtCGGTGTTAAGGTCATTG ACCGTATGCAGTTGATTTIGGATATCTTTGCCATGCGGGCT^ 
awgCc^ TGAAGGGAAGCTCCAAGTCCACCTAGCCCAACTCAAATACCTCTTGCCTCGCTTO 
CG GAtT kTXTCTCAGCCGTCAGGCAGGGGOAATTCGT^ 

AA CCGT CGTApCGTTCXiCAATCAA ATCACGGA^ATCGAGCGCCAGCTTA AGGTGGTTCAGAAA AATCGT 
GC 3ACT 3TCAQAGAAAAACGTTTGGAGTC T AGCACTTTTAAGATTGGTTTGATTGGTTATACT^ 
GP MAT CAACTATCATGAACATCTTGACCAGTAAOACCCAGTaTGAAG 

GG \TGG 3ACAACCAAGAGT ATTCATCTGGGAGGCAACCTCCAAGTA ACirTGACAGAtACCGTTGGCTTT 
. £*v AT XAA 3ATTJ|GCCGACAGAGTTGGTC^ 
\< TT( :iGG' TCATGTTATCGATGCTAGCA ATCCTrACCACGAGGAGCATGAAAAA ACGGTTCTCTCCATCATO 

AA \GAC ctggacatggaagatattcctcacttcacgctttataat 
C<5 XTAt ^ccaAaggccatataccctcatitctgcca^ 

ATT "GCtj lGATAAGATTAAGGAAATTTTTGAAGCATITACCCTO 

AT *CAT< jATTTAG AGAGTGTTGCAATTCTGGAAGAACGTGATTATCAGGAAGACGGCGAAGTO 

gc aca' ttcgg agaaaa ataaatggaggttagaagaattttatndacctcgagcaccack^ccaccac^ 
ctoa ' " ; • 

«'••'..' - * 

2CM142 "homologue. of SEQj ID NO. 38". 
ATC rGCA<BA AA AAACATATCCTA^ 

AA/ kTTGdSTTGGTCGACCAGAAGTGGtAGAACGGAtTAAGATTO ' 

(UgtgAgtacgaagcagctaaggatgaacaagoctt^ 



GTG 




CO 

\$ 

jfcc 

TC^CATfCCAAGAAATTGGTGAGGACGAAGAAGAAGTTrATATTATCGTAGGTTCAGCTC 
^ CTT TOCA GGTAAGGTTTCAAATG A AAGCCCAATTOGGCAGGCCnrTGATTGGCAAGA AAACAGGT^ 
AGC !A AC< : ATTOAAACGCCTCTTGGTAGCTATGATGTAAAAATCTTGA^ 

GC* ccAx icaccaccaccactga . ' 

" hpm&a.o^ue_'.Qf $EQ. ' ID NO;- 39" 

ATGACC^T^CTTGTAGGCTTGCKJAA^ 

TTArotTpATT<pATCAACTAGCGAAGAAACAGAATGTCACT^ 
^. CCT *GCATCCTTTTTCCTAAATO^ 

^ GGAAMACAGllrCATGCTTT^^ 

TCP PAGkTGGAAGTTqGGAAAATTCGTT^^ 
\^ GTt rATtATTCAXCATAtAGGAACTCAGGTCm 

> GOT fvTGT CAGTTGTTCATCAtPTmGAOTAAGTTroACAGGGATGATrATATCGGTATTlT 
TGA DAAA GTTG f CGATTCTOTAAACTAGTATTTACAAGAGAAAAAAm 
■ TAA -GGa CTCG. jkGCAGGACGACCACCACCACTGA 
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ichii "Jiomologue of SEQ., ID NO,. 49" ,. . . ' , 

ATCATtTrAAfrACAGGGGCAA^ 

AJ^GAAVACGIfGGOAGTAGATGTGGCTGAGAtGGACATTACCGATGCAGA^ 
A; iGXG jTGAAACCGACTTTAGTCTACCACTGTGCAGCCTA 

Gj lAAA SAGTTGGACrrCGCeATCA atgtgacggggacaaaaaatgtcgcaaaagcatctgaaaagcato 

6' 'GCfiii iCTCPAGTTrATATTTCTACGGACTATGTCTTTGACGGTAAGAAACC AGTTGGACAAG AGTGGGA 
A< iTTG 1 / TTC3ACCG ACC AG ATCCACAGACAGAATATGGACGCACTAAGCGTATGGGGGAAGAGTTAGTTGA 
QJ AGC, iTGTGTCTAATTTCTATATTATCCGTACrGCCTGGGTATlTCGAAATTATGGCAAAAACTO 
TI ACCATGCA : AAATCTTGCGAAAACTCATAAGACTTTAACAGTTGTAAATGAC 
1'^ TI GGAflTCGTACCTTGGCTGAGTTCATGACX:^ 
*V TC TCAA^TGATGCGACAGAAGACACAACATGGTATGATTrrGCA 
CC AAG" CaaGCCAGTAGATTCCAGTCAaTTTCCAGCCAAAGGTAA 
Q1 GGCC AAAGCCAAAGCTACTGGATTroTTATTCCAACTT^ 
AV C AAj( JTGaG ACTCG AGCACCACCACCACCACCACTGAi 

2CSE45 ' "homologue of SEQ . . ID . NO . ,4 V.' 

a^gaaXcgttictctcgactctagagtcgattat^ 
ot ggt ggctatctatatagcchstragt^ 

CCCGTGlGATTXJCCrtGGGGCTTGTGATTGG 

TGACCttCTlTCrATATATTTrAGGCTTGG<}ACTTATGATC 

Tp SCAT SAACGGGTGeCAAAAAeTGGGTATeMTAAA-TC 

GA AGA5I ATCCTATATCCTCATGTrGQCTCGTGTCATIG 

CG DACCGTK^GCTGGACTTTTTGTTAATrr^ 
v GC ACTT SAAAGTXJACTTGGGQACGGCTTTGGTrT^ 
v£ GprrTQfTGGAAAATTATTATCCCAGTATTTGTGACTGCTGTAACAG 

TTATTA'GCAAGGACGGACGAGCTTTrCTTC 

GG -TTGpCTCAATCCCTTTGAGTTTGCCCAAACAACGACTTACGAGCAGGCTCAAGGGCAGATTG 
l >: GGPAKJTCGTGGCTTATTTTGTCAGGGATTTAATGCtTCGAATCTG^ 



G A rTTTitACGGTrATTGGAGAAGATTTTGGCtT^ 

ga rrxA< xigtatgttgaagattactcttaaatcaaatAaccagttctacacttatatttccac^ 

TtJ iTGA' 'GTTGCtCTTCCACATCTTTGAGAATATCGGTGCTGTGACTGGACTACTt^ 

CC< :tJtG( ictttcatitcgcaagggggatcagctattat^^ 

AT( jAlbr ACC^GACTAAtCTAGCTGAAGAAAAGAGCGGAAAAGTCCCATTC/L^ACGGAAAAAGG 
7T^AAA( JAAATTAAACTCGAGCACCACCACCACCACCACf GA . 

2CJ*4$. "honiologue of SEQ. ID NO. 4Z n . 
ATOGGAjlAAATCATCGGAATCACfGGGGGAA^ 
r\ AG* AAG' SCTTTCAAGCAGTGGATGCCGACGCAGTCGTO^ 
^ TGAGGG" ^TAGTACAGCACTTTGGGCAAGAAATC^ 

GCTAGTC TCATGTrTTCAAATCCTGAAGAGCAA AaATGGTCTAATCAAATTCAAGGGGAGATTATCCGTO 
M AG( JAAC' IGGCTACTITGAGAGAACAGfrGGCTCAGACAGAAGAGATr^ 
'i TTI TGAG CAGGACTACAGCGATTGGTTTGCTGAGACTO 



£CFE47 



I 



TTI TGAG CAGGACTACAGCGAT 
G A; .CGC TAATGAAAAGGGACCAGTTGTCCAAAGATGAAGGTGAGTCTCGTCTGGCAGCCCAGTGGCCTT 
TAX AAA; AAAGAAaGATTTCGCCAGCCAGGTTCTIGATAAT^^ 
AA< TOO TATCCTTCTTGAGGQAGGTAGGCAAGATGACAGAGAtCTCXSAG^ 

oa-W i, / ...... 



Vhomologue of SEQ* ID NO. 43" 

ATGkGA/iAAAtTGTrATCAAT^ 
. GTG rCGT rGCCCTAATTCCAGCTATTATCTTGGCTG^ 
CGG AljG? AGCCAGTCTTGTGGAAATCATGGAATO 

GGA G ATT GACCCA AGAGGTGT$CA AAATATTCCAATGCCTrATdGTAAAATTAACAGTCTTCGTGCATCT 
V-JS TACTATt TTATGGGAGCCTCTTAGGGCGTTTTGGTGAAGCGACAGT^^ 
TGG rOCTl ^TCGGATTG ACTTACACCTTAAGGC«TTTG 
GAT ^ACMt>AA|GTTATCTGCTAAAGATACAGGACTTCA 
TGGpAGd^ACG^TTAATACGAtGATTGCTGCGGTTAAAGCAAATGGTC 
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O XJTG UVCCTQAQATTATTGATGTAGCTACTCTCTTGAATAATA 
"T^S A :1AA TATCAT CATTATTGATO^ 

^ r *GAA' K:TG<bAACATATATATCTITAGCTGCTGCAGTTGGTA AAGGA ATTCOTATA AATA ATGTrCTTTAC 
\S Oj VACA :CTGC5A AGGGTTTGTTGCTA AGTTGGAAGA A ATGGGAGTGAGAATGACTGfATCTGA AGACAGC 
^ A TTTI OTCG^GGAACAGTCTAATTTGAAAGCAA^^ 

(So GATf TGC^CAACCGCTTACCCCTCTTTTACTAAGAGCGAATGGTCGTGGTA 

^ TJ JCGA U\AACGtGTAA ATC ATGTTTTTGAACTAGCAA AG ATGGATGCGGATATTTCGAC AACAAATGGTC 
. ^ A.' ATT] TGTAlC^CGGGTGGACGTGATTTACGTGGGGCCAGTGrtAAAGCGACCGACTTAAG 
^ f< fCAG' VVGTCKTTGCTGGGCTTATGGCTGAAGGCAAAACTGAAATO 

G< iTTA' TCTCATATTATCGAA AAATTACGT AATTTAGGAGCGGATATTAGACTTGTTGAGGATCTCGAGC 
. ACCAQIACCACCACCACTGA . 1 . 

2CIE48 'horaologue of SEQ. ID 5 NO|. 44" 

ATC1IX?4Ai5AATTC 

GA/frGATAAAGTAGCATCTTATCATATCGATAT^^ 

ClIcCTT jGTT^ATTCAAGAAGTTCaaAAAATTAGTCACACACCTTTATC 

CGAACC TTtTOGGTAGATCA AGTTCTCGAITTACA ATGTGAGTATATTrGTATTCATGCTGAAGTTCTG^ 
TOGTCT rGCTtTTCGTTTGATtGAT AAAATTC 
A/ ACA,< ICTGTTTCTACAATCTTTCCCTACATTGATTTACTTC 
l -^V > GC TTTT jCAGGAGAACGCTTTTTGGAGTCTACCTTGTATAA AATCCAAGA ACTC * 
H QA AiGC TTATCAGTACATCATTGAGATGGATGGTTCTTCGAGTCGTAAGACTT^ 
GC AGG> .CCAG^TATTTATGTTATAGGTCGCAGTGGATrATTTGGTT^ 
GG GAT/ TCTOTT^CTAGLAGATTACGAAGAAATGACCGGAAAA 

^cc*cc:accactga j 

jLCFE49i "homologue of S3Q. ID JTO. . ^5" 

at 3aga|aatatggctttgaCagcaggtatcgt^ 

Ca fctTA ZAAA)AGCAGGAGCAGAGGCAGCAAAGTACCCAT^GCGACTATTGATCX^ 
TG jAAG TTCCAGATGA ACGCCTACA AAAACTA ACTGAAATGATAACTCCTAAAAAGACAGTTCCCACAA 
CA ITTGJ VATTTACAGATATTGCAGGGATTGTAAAAGGAGCTT . 
TC TQG< ;cAATATFCGTGAAGTAGATGCGATT^^ \ 
GG< -GAG MAGGACGTGAAGACGCCTTTOTAGATCC^Crc 
Tf< :rjPGC !TGAq{rTAGAAtCAGTGAACAAACGATATGGGCGTGTAGAAAAGATC 
AT> IAAG ^TC^GTAGCAGAATTCAATOT^ 

ct< :&tAi :CATf GAATTAACAGATGAGGAACAAAAGGTTGTCAAAGGTC 

M> AG 1 rrcri tatgtagctaatotggacgaggatgtggtticagaacctgactct^ 

H; AT! CtelX iAATTTjTGCAGCGACAGAAAATCCTGAAGT^ 

C?< iAAT: GGATCATGAAGATAAA AAAGAGT^^ . 
GT GAti( fCGTGCAGCTTA(XACTTGCTTGGATTGGOAACTTACCT 
G<T TGG> CTTTCAAACGTCiGTATGAAGGCTCCTCA^ 
G<p- TTATJTCGTteCAGTAACCATGTCATATO^ 

AG< JTGGACGClnrGCGTGAAGAAGGAAAAGAATATATCGTTCAAGATGGGGATATCA 
AAVGTCdTCGAGCACCACCACCAOCACCACTGA 

• 7jCF]E50- "homologue of SEQ. NO. 46" . 
ATC G^ATCGAaAAAACC^TC^ 
^ A A/ T^A^ lTTATJATCGAGCTGTACTACGCT^ 

% cac tcgt caggctgtctatgAcaatatcaagcga acagaaaagattctggaagXttatoagatgaaatt 
£\ gca catc tactipggactatattgtccgcagtcagatttt^ 

Vt ACT ITCT SCAGGAGCAGATAGAAATnTAACAAdCATTG ATAATAGAGAACTCGAGCACCACCACCACC 
' ACCACTGA J: 

2XF,fc5li' J"tiorhologue of 'SEQ. ID .NO. . 47" 
^ . ATG A.QGT|rAGAATGGGAAGAATI^ 

CX GTGSTATrCGTAAGCAATATCGTAAGCAAAATA^ . 
j^GCC VATT SAGAGCATCAAAGA^AAAAt GGCTCGTCXiTGGCATTACTTATGCG AOT1TGGAA 



o 
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\ ti iGGA rATTGCTGQCTTACGTGTGATOGTTGAGTTTGTAGATGACGTCAAGGAAGTAG^ ^GQATATTTTOG 
Vi> A< l AAG :GTCAGGATATGCGAAl€AtACAGGAGCGAGATTACATTACTCATAGAAAAGC^TX^GGCTATC 
«vl G' TCC1 ATCATGTGGTAGTAGAATATACGGTTGATACCATCAATGGAGCrAAGACTATTtTGGCAGAAAT 

^ Tt AAA' :tcgtactttggccatgaatttctggggaacg 

Gi tftU CXJAGATGAGAtrAAeAAGCGACtGGAAATrAeAGCTAGAATCGGCCAt CAGTTGGATGAAGAA 
Wt V GGG' "GAAATTCGTGATGATATCCAAGAAGCGCAGGCACTTTTTGATCCTTTOAGTAGAAAATTAAATG 
a< mi < }TAG&AAACAGTGAOGATACAGATGAAGaATACAGGCTCGA 

2 CI ES2 / "homologue. of SEQ. ID NO. 48 1 ' 

at ggaActtaatacacacaatgctgaaatcttgctcagtgcagctaataagtcccac^^ 
li CT&l ; cagagattgccctagc^gggcgttcaaatgttggtaa^ 

CC GTA7 GAATCTCGCCCGTACATCAGGAAAACCTC^ 
^ q/ CAA< »ATG<X3CTITOTGGATG?IGCC7GGTrATOGCTATG 
^ G<GGG'GCATGATTOAGGAGTaCTTAACGACTC^ 

tc gIca TCaCCCGTCAGCAGATGATGTCCAGATOTACGAATTTCTC 
\'^S An rG p GQ£X3A lcCAAGGCGGACAAGATTCCTCOTGGTAAATGGAACAAGCATGAATCAG 
^ A/ ATTA AACTTTGACCCAAGTG ACG ATTTCATCCTCTTTTCATCTGTC 

CT IGGG ATGCAATCTTAGAAAAATrGGCGGCCGCACTCGAG 

4 01^53 "homologue of SEQ.. ID NO, 49" ' ^ . " • _ 

v i^AAACAAGAAAAATCCCTTro J A wi# 

^ TP :gca ITGTC1AAGAACAAGGAAGGACAAGTCTTTATTGATCCTACGGGCA 
tss^ <5 r/^TA ICAAACTAGACAATGCAGAAGCCXTAGA 

\^GC *TG$ AAGT^GAAGAAAGCTTTTATGACGAGTTGATCGCTTATGTGG ATCACAAAGTG aa aaga AGAO 
AGrTGCiAOT . 

<2CJTE54 "homologue of SEQ, ID^NO. 50" 
l Atp }TTAAA ACCGTCTATTGATACGTTGCTCGACAAGGTTCCTTCA AA 

a wUcc tgcccacgaattggMgcaggtgccccagcaa 
^ cGcocr tagaagaaatxdgaatcaggaaacgttacaattcacccagatccagaag 
up* CT< SCGTi ^gccgtatcgaagaagaaaaacgccgcaaAgaagaagaagaaaagaaaatcaaagagcaaat 
^ TG< TAAj igaaaaagaagatggtgaaaaaattctcgagcaccaocagcaccaccactga 

£CFK5$< "homologue of SEQ, ID. NO. 51" 
so A T$TCAtTAAeATCAjUACAACGTGCCTTCC^^ 
I s - T<k rGGAAAAATGGACTCAACQACCAAATCAAAACCAGCGTC 

>^ T& AGQTTACTCTCTTACAA A ACACAGATGAA AACATCCACGAAGTAGCTG A AATTTTOGAAC A AG AAA . 
\2> TCC tGTQfGdATACAGTCGAAAAAATAGGACGCATCTTGATr^ 
. TCC CAAGAttfCTAAGAAAGTCAAAGAA 

ZCFI56 '"homologue of SEQ. ID, NO* . .5,2" ' > ..\ * 

Alt CkSGATTGAAAATTATATACCAGATITrGC 
. AGC^GdAGGG/^TCAAGCKJTGTTTTGGTCGATTTGGATAATAC^ 
K* AAC :GCC> l GAGATGAAGCAATGGCTACATGaCCTTCGGGACGCGGGTAJTC 

AAC ACC/ AAAAACGCGTTCAACGAGCAGTTGAGAAATTTGGGATTG 
^ COT FCAC VTTTiBGTAITGACGGTGCTATGA AGGA ATO 

l £ CQA CCAC CTCATGACAGATATACGAGCAGCCCACCGTGCAGdGATTCGGTCAATT^ 
GTC CAAC ATGACTCAATCAAAACGCAGATtAACCGAACTC^ 
A Y AGTACC*/|CCGATTA(^TATAAAAA 

^ OatfE?? "homologue of SEQ. ID NO. 53" 

N ATfS rTTCpAAAAATTTTAATXtjGCAATCGTGGTCAAATTGTO 

GGG G ATI GCGyJcGGTAGCGGTTTATTCAACT 
V$V GCA 3TTT jTATraGTCCTGGCAAGGCAACAGAGf CTTATCTCAATATTAATGCAGlTCTATCAGCTGCAGT 
-W> CTT^ACtpAGG^ 
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GT G^lAC i aagtaggtatcaagtttatcqgtccatctggtcatqttatc 

Tccqcc TGCTCAGATGATTAAAGCAGGTGTGCCTGTTA^ 

QA AQAj LGCTWGATTGTTGCTGAAAAAATTGGCTATCCTGTTATG 

01 AuAAOGGaTTCGTAAGGTTGAaAAACCAGATGACCTCGTTTCT 

CAA^GCAaTTATGGCAATGGTGCCATGTAC^ 

CA AATtfeTAGQTGATGAGCAtGGACATGTGATTCACTTGGGTGAACGGGATTO 

AC CAAA AGGTTTTGAA AGA AAGTCCCTCGATTGCAATCGGAAAAACGCTGCGTCATGAAATA6GTGCTO 

CT 3CTG rTCG^GCGGCAGAGTTTGTTGGCTATGAGAATGCAGGAACCATTGAAT^ * 

AA GTAC AAAT| jTTCTATTTCATGGAGATGA ATACT03TCTTX^GGTAGA AC^TCCAGTAACAGAGTTTCTrT 

TC 1GGT 3TTG#TAXCGTTAAGGAACAGATTTG^ 

AT fcTTG rCCTACGCGGTCATGCCATCGAGTGTCpT^^ 

AG rCCApGTAiAGATTACTAATCTCTATCTGCCAAGTC 

AT ZG^GbTTAtACCATTCCGCCTTATTATG AT AGTATGATTGCCAAAATCATAGTACACGGCGAAAATCO 
TjT fTGA -GCCITGATGAAAATGCAACGTGCCCTCTATGAATTAGAGATTGAAGGAGTC 

ga rrirc 2AGCtigatctcatttcagatcgcaatgtcattgctgg.gga 

a^cctt ottacctaaatatcaagaaaaagaactcgAgcaccaccaccaccaccact 



^ 2CF5S»i "homologue of SEQ. ID NO. 54" 
Cv AT ^T^ACAA^GTTTTre 

jv. AP ITAG VCATCGATGCCAGCTX^GAACTTGAGGGCCGTATCACTGCTCGCCAACTTGTC 
fN> CC( ^tj3A 3TACAATATCOAGTATATCGA ACTCrTGTCmACAAATTGCTCGATTACGAAAAAGAAACTGTC 
^ GP jTTC( jAAAl^ACGGAGTTCCn^GAGCACCACCACCACCACCACTGA 

■ *■ 

2 CF]2$$(i ^homologue of SEQ. ID NO* 55" 

AT< SMGDATAOATATATTITAGCATTTGAG ACATCCTGTGATGAGACCAGTGTCGCCGTCTTGAAAAACG 
Ati jATG \GCTCTTGTCCAATGTCATTGCTAGTCAAAT^^ 

LApTAGCC^GTCGTC^ 
GA' rmC< :GA AGAGGACQTGACAGCTGTrcCGGrrACCT^ 
GG' TTG1 CAGCTGCCAAGGCCfTTGCTTGGGC^^ 

GC LCCT< lATGGCAGCTCAGAGTGTGGAGCCTTrGGAGTITCCCITGC^^ 
O AG LCAG VGTT&GTCTaTGTTTCTGAGGCTGGCGATTaCAAGA 
^ TGC rGG A K3CTTATGACAAGGTCGGTCGTGTCATGGGCTTGACCTATCCTC 
^ CTOGCTCATCAGGCGCAGGATATTTATCA^ 

. rp ce: tctcaggotgaaatctgccttta 

^ TA( '-ACAu iGATinrGTGTGCTTCCJTCCAAGGAGCAGTTATGGACATTC 
TIC GAGJ AATATCCTGTTAAAACX^CTAGT^ 
G|JC TAGC lAACtG AA ATC^CAGATGICAATGTTATGATTCC 
TA1 GAT1 GCTTATGCCAGTGTCAGCGAGTC^ 
. CCA AQTk iTTGCCTTTGATACCATGGA ACTCG AGCACCACCACCACGACCACTGA 

: * I" (.-.■.* 

^jcBBfiO 1 ^homologue of SEQ. ID^NO. 56" 

ATC ^TdGAAT^GnGGTGTTGTTGGA AACACAA ATGCAACTOATATITroATTCAAGGGCTTGAAAAGC 
TTGAAT^CCGTGC^ATGATTCTGCGGGAAlTITr 

GTT GGTC OTAT^CAOaATTGTCTGCCAAGACAGCTGGTGTTGAGGGAACAACTC 

GTT GQ Gt AAClipAlGGGA AACCAACGGAAGACAATGCTCACCCACACCGC 

TCT TGbT rCACAATGGGOtGATtGAAAACTACCTTGAAATT 

CA!A AGCK fCA A ACAGATACGGAAATCCTCCOTACATTT^ 
55 Aqi TCTT 3A AGpqtlTA AAA AAGCTCITCATAT^ 

AAA ATCC AGATtoTCATCTATGTAGCGAAAAACAAATCTCCACTTTTGA 
^ TATCGTC rGCTGAGAtGCTAlGGCTATGATTCGTGAAACGAACCAA^ 
Vi; TTGGT^A rCGTpAAGGCTGATAGCGTGGAAGTTCAAGACTAT^ 

ATA CTpC jGAACTTGACTTGTX^GATAtCGGTAAGGGAACTTATCCT 

tg a 3caa ccaactgttatgcgtaaactcattcaagcctac^ 
. cct scta :cat^ggctgttcaagac<k:agaccgcatct 

GAG 3ATT rcCrtCTAAGAAAATGTTC 

GTG 3GGC rACGpTATGCCACTTCTCAGCAAGAAACCACTC 



42/60 



WO 01/49721 



PCT/USOO/35604 



I 



'4 



rdgaragtcgtcaaottttggtcaaggctaatgaaatgggaattccaag 
caggt rcaaccctctcacgtgaagccaactataccatc 
caa :taaAgcctatacaocgcaaatc 

GfcTAA rGCTMAGCGCA AGCCmGACCTGGTTC 

ctc^tcagXgaaagaaacc^ttgaagcca 
cAtcggacgtggtcaagattactacgtagccatggaagcaagtc 

d iGTG' :g aaggttttgcggcaggagaactca agcacggaaccattgccttgattgaagaagg aacacct 

6' XfTTi GCTdTC^GTCAGAtCCAGTCCTTGCCAACCACACTCGTGGAAATATCCAAGAGGTO 
l£ GXjGTGCTA^GGTCCTCACTATC^ 

K TACJ .CCCTTACCTXnXJACCAATTTCAATGGTCGTACCGACG 
A< :CGT( iGCCTCGATGTGGATAAACCACGTAACCTTGCCAAGTC 
ACGACCAGeACTGA 



I 



2Ctf£$t "homologue of SEQ. ID., f NO. 57 
ATGATiCGTATCGAAAATCTCACT 

TC CA r TC GACCAACAATTACCGGCATCATTGGTCCA AACGGCGCTGGGAAATCAACACTATTAAAAGGTAT 
QC TGO< rAATTjATCCC^CATCAAGCTC^ 

Al r<?cc tatgtcgaacaaaaaatcaatatcgactacaactttcccatca^ 

TA GGAC TATrtCCCTCTAtTCCTCTCTTT^ 

OC TTG A AATCGTCGGCCTAGCTGACTACGCTGAACGTCAAATTAGTC 
cdGGTtTTGATTGCCAGATOTTC^ 



T>o 

C^CGTtTCACCACGACCTCAGCAAGATTCCCCACT^ 
AT TGCC rTTGGTCCAACAAAAGAAACTTTTACCGAAACCAA 
TtT tTCA ITGGAGGTGACCTACTCGAGCACCACCACCACCACCAGTGA 

i • T r • 

,Z€F162 ( "homologue of SEQ. ID NO. j58 u 

AflCCCkAAaAAGTGAATTT 
OA rGTT 2ATTTTGTCCATAAGGCCTTGGTCTATGCTO 
C(S \GCC rTATATCATTCACC^TAT^ 

GT IGCT rGTG0AlTCTTGCATCATGTGGTGGAAGATACAGATGCGA 
TT< iGTC< XGATGTGCGGATGATTOTTGACGGAGTT^ 

Gp \GC A ATTAGCGG AAA ATCATCGCAAG ATGCTCATGGCCATGTCTGAGGACATCCGCGTTATTfrGGTC 
AA \CTC rCTGACC^rCTTGGACAATATGCGG ACCCTGAA ACATCITCGAAAAGACAAGCAGGAGCGTATTr 
Ctj i AAG lAACCATGGAAXTCTATGCCCCGCrc 

A(j] VAGA ctrotCTTTCCGTTATCTCAATCCA ACGGAGTTTrAGAAGATTACCCATATGATGAAGGAAAAG 
CQ< :aGG 1 AGCGt GAGGCCTTGGTG6ATGAGGTAGTCAC 
TTC fAA A< JGG A AGATTTATGGTCGTCCCAAGCATATTTACTCAATTTTCCd^ 
AA< jtfOT TGAGGAAATCTATGATCTGATTGCTATTCGTTGTAtTTTAGATAG^ 
. ATCiCTTK GTTACGTGCATGAATTTTOGAAACCGATGCCAGGTCGCTTCAAAGACTATATC 
a6< iCCA. ttGGTTATCAGTCTATCEATACGACTC . 
AA< :CAA< KjAAATGCACGAGGTGGCTOAGTACGGGGTWCGGCTC 

G<5< rGCA igttaagagcaaggaatcagctattcgaatgaactggAtc^ 
CC/ GGC' gatgatgctaaggaatttgtggactctgttaag^ 

ACCC CAGATGGAGCTGTCGGTTCCCra 
*> tac; CAa< GTCGGTSAA AA AGCAACTOGTGCCAAGG^ 

A*J .GAG LGGGGaTCAGGTTGAA ATTATCOta " 
. ATC GTGA AGACTAGCA AGGCGCGCAATA AGATTCGCCAGTTCTTTAA AAACCAAGATA AGGaATTGTCT 
GTC AAC* AGGGfrcGTGAGATGCTGATGGCTCAGTTCCAAGA AAATGGCTATGTGGC^^TA AATTTATGO 
ACAA^ CCAdKTGGATC 

tgg rrrr j6ggaaatcggtgcgattaccgtotttaaccgtctga 

CGt 3CCA AGGC&MGGCTGAGGCAGAGGAGOT 

A AC TCTC UGGf CAAGCATOAGGGGGGAGTGGTTATTOAAGGTGCTTCTC 
A AG TGTT 3TA ApCCCGTGCCTGGTOACGATATTGTTGGCTACATTACCA^ 
ACQ 3TGT 3GACFGTATGAACCTGGGTGCCCAAGAAAACTACGAGCAACGTC 

AOX CCAG TACTCTAGCTCAAAt AAGG AGTATATX}GCCCATATC»ATATCTACGGTCtGAACCGTACAa6A 



SO 
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c2, drqTTbAACpATOTACTGCAAGTTCTrrCAAATACAAC^ 
T T C CAAC GATATX3AAGTtTGCTAATATCCATGTGTCCTTCGG 
V. ^ \^ cfrcGArAAAATTAAGAGTGTGCCAGAAGTT^^ 



it 



2c3FE64j "hoinolpgue of SEQ. •ID NO. 60 
Atq AG AGAAOAAATCA AAAATCTGCAGGCAC^^ 

AGAjGCTOTTCGTATGCGTCCAGGGATGTAC 
TCTGC* JAAATTGrTGATAACT^^ 
G AGCCAGATuArrCGATTACTGTTGTGGATO 

G rCGT( :ctgctgttg AGACCGTCTTTACAGTCCTTCACGCTGGAGGAA AGTTCGGCGGTGGf GG ATaCAA 
G 3TITK !AGGTGOTCTTCACGGGGTGGGGTCGTCAGTTGTTAATGCCCTTT^ 
T< :CATi AAAACGGTAAGATTCATTACCAAGAATACCGTCGTGGTCATGTTGTC 

T< iGAQ ^tacggataaaacaggaacaactgttcacttcacaccggacccaaaaatcttcactgaaacaac 

Ai ITUIT TGATTnTTGATAAATTjiAATAAACGGATTCAAGAGTTGGrc 

C* 'AT(A ctgataagggccaaggtttggaacaaaccaacksattatcaitat^ 

C< >TtC/> ATATATCAACG AGAACAAGGATGTAATCTTTGATACACCAATCrATACAGACGGTGAGAf GGAT 
Qj kTATi( :ACAGTTGAGGTAGC(^TGCAATaCACAACX}GGTTACCATG^AATO 
A' VitTC ATACACA1X3AAGGTGGAACGCATGA ACAAGGTTTCCGTACAGCCrrGACACGTGTTATC 
Tl ATGii TCGTA AGAATAAGTTACTGAAAGAC AATC5AAGACAATCTA ACAGGGGAAGATGTTCGCGaAGO 

ti TAA< tgcagttatctcagttaa ac^cccaaatccacagtttgaaggacaaacgaagaccaa 
A> TAGk igaagtggtcaagattacca atcgcctcttcagtgaagccttctccg^ 
>c c/ ca<3j /ttgccaa acgtatcgtaga aaaaggaattttc 
. gc gtg!^ agtcacacgtaaaaaatctggtttggaaatttccaac 
tc f aa't aaccctgctgaaacagaactc^ 

CI CGT& ACCGTGAGTrrCAGGCTATCCTTCCAATXC 

GG ATA^ G ATTCT AGCTAACGA AGA A ATTCGTAGTCTT^ 

TT rGAT jTTTGGaAAGCCCGTTACCAAAAACTCGTTTTC 

tr :gta sccttcttttaaccttgatttatc 

CC :AAf CACCAATCf ATGGTGTCAAGGTlrqGAAGCGAGAT^^ 
A'A Q aAATCAAACTCCAAG AAGCTTTAGCeCGTTATAGTGAAGGTC 
TA \GGgGCTAGGTCAAATGGACGATCATCA<TCTO 

CG CTTAGAGTTTCTOTAGATGATGCTGCAGAAGCAGATAAAATGTT^ . 
GA 3CCt DGrCGTGAGTTTATCGAAGAAAATGCTGTCTATAGT^ 
ACCACCACTGA 



2 CF£65 < ''hdmologue of SEQ ID. NQ* , 61 " 
^ ATOGGaITITACTGAAGAAACAGTAC^ 



TGACAQMXrTTTATC^ 

GA1 it(3G igaccctgcctacgtttctcgttataata atgcacgaaatcaaatccgtaaGtatgagcgtgat 

H> QAj iAtC }TtQ AGOAATTOGTTCCKiTAGTATCTCAAAGa ACAAGGAGTGGATCTACTCG AGCACCACX3ACC 

acoaccJj ^cta^ . - . 

£ CJTOtftf | 'Ihbmologue of SEQ.. ID NCU..&2" 

ATC GtCAACTATCCACATA AA6TTTCAT CACAAAAAAGACA A ACATCTCTTTCTCAACCCAAaA ATTTOT 
. CAi« AJTdC lAGGAATGTCTTTtGAAAAGATGATCAATGCT 
TGI TATA CATAACAAACCAACTCCTATTCAAAT^^ 
^ A^GTTG^GCCTAtThrCGACAAGCnrrCAACGACG 

. CTT rGAA GTCAAGGAA ACA A A ACAA AA ACGTGCGATTCCGAT GAAAA ATTTTCATCCACaTCAGATTCAG 

.X. CA*1 ATJGC aacaagtccttgccgaacaaggaatctgctttgtc 

V£" CTA CTTA rTGCCGGCATTCkjATITGATTCGCTTCTATC 

AAT ATAt TCG AfflAATATGGATATGAAATCAAGGCTXKjTGCCTrc 
AAA G.AAf ATT1)ATTaG(57W 

• • | * • * * 

2CPE57 /"homologue pf SEQ. ID .NQ., 63" 

I II • 1 • ' ■ • . ■ ' - • ' 
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zThGCtCTAmAGTAAAAAAGATAAGTATATTTC^ 

>a3CtaWcagaggttccagatgaattattttcc(^gtctc.caggctgtmg^^ 

GG\lTCTGGGAAGTGAACGTATCTGTCCGCACTGTAGCTATACCTrTCGTATTTC 

CT Tl!iA ; < JGATTGATATGGGAACCTTCAAAG AATTGmACAGGGAtrGAAAGCAAGGATCCGTTGCATTX 
CC< TG<j "TACCAAA AGAAACTGGCATCTATGCGTGAAA A AACAGGTCTGCATG AAGCCGTTGTGACAGQ. 
AA "TCC rCTTATTAAAGGTCAGACTGTGGCTCTTGGGATO 

o gtacgIiTGtIgGTGAAAAAATG^^ 

AT 'CAC ^OCCTCTGGTGGAGCCCGTATGCAGGAAGGAATCATGAGTCTCATGCAGATGGCTAAGATCTCT 

V*^' GC ^G3TTAAACGCCATTCA>ATGCTGGfCrCTTTTACCTGACCATOT 
l ^ GPSTCA^GCTTCTTTCGCTATGGAAGGCGAtA^ 

GG 3CGT ^GTGTGATTGAAAATACGGTrCGTGAAAGCTTGCCTGA<3GATTTCCAAAAGGC^ 

TA( 5AAC iTG^CTTTGTGGATGCTATTGTCAAAAGAAGAGACTTACCAGATACGATTGCTAGCCTAGTCAG 

Ar X3CA :ggagggagtcctagactcgagcacIcaccaccaccaccactga 

I 1 - i 

* ' il ; . * ' * i , . , 
fit 16$ t "homaladUB of SEQ.* ID NO!. 64" 

atomjAJvttaSgggatpggacgtcggtt^ 

tx> CA<fcCTCiU*GG&TTC^ 

^ AATTGG' TGA^CTTACAAGGTGGAACGATTTGTAGTGGGCTTGCCTAAA AaCATGAACAATACAAGTGG 

^ ac( :gcg< ;gtagaagctagtcaagcctacggagcaaagctagaa^^ 
v>n ck< !gat< } aacgcttgacaacagtggctgcrgagggcatgttgattc 

^ aagcgc vagaaagtcattgataagttagcaqctcagctgattttacaaaatta . 
% arc 169 

ATC rACAAAACTTACTGTTAAAGACGTTOAC'^ iUAAAUVj i maaaaui uti k*\j m-uiuxi wm- i i w\Myv 
TA< CA'H OA AAGATCGCGTAATCACTA ACGATA ACCGTATCACAGCAGCTCTTCCAACTATTAAGTaCAT 
CA1 XXJAV ^CAAGGTGGACGTGCaATTCTT^ 

GG' AAA' CACTTGCTCCTGTaGCAGCTGACTTGGCAGCAAAACTTGGTCAA 
CA< TCGjl GGTGCTGAATTGGAAGOGGCAATCAACGCTCTTGAAGATC 

ac cgtj jCcgaAgatgttgacggcaagaaagaatctaAaaacgatc 

TO CTTC GAGATGGTATCTT€GTAAACGATGCATtCGGTACAGCTCACC6TGCACACGCATCTAACG 
GW TGTC AGCAAACGTTGAAAAAGCAGTTGCTGGTTTCCTrcTTGAAA 
AGC IAbT' GAAAGTCCAGAACGTCCATTCGT'GGCtATCCTTGGTGGTTCAAA 

GTI ATCC AAAACTTGCTTGAAAAAGCTGaTAA AGTC^TTATCGGTGGTGGGATGACmc^CATTCTACA 

^ iUC fCAC^AOOTATOGAAATOGGT^ 
\fr TOA AAA; k GCA AATGGtAAATTGATCTTGCCAGTTGACTG^ 

^ GA/ gtg<Egtgacactg aaggtgaagcagtttctg aaggcttccttggtcttgacatcqgtccAaaatcta 

TCG CCAAATTTOACGAAGCTTTGACTGGTGCCAAAACAGTTGTATGGAACGGAG 

A Afi CCti GATTTCCA AGCTGGTACAATCGGTGTG ATGG ACGCTATCGTGAA ACAACCAGGAGTTA AATC 
A AT CATC GGTGGTGGTGACTCAGCTGCCGCAGCGATTAACCTTO 
AG7 AGGC GTGGTCGAGCAT CA AT GG AACITCTTC AA 
AAA A Ad' tCGAGCACCACCACCACCACCaCTGA 

™ 1 4 • * - 

• * I * ,» • * • 

2.CPE70 1 . "homolosue of S.EQ. ID NO. 66" - • . ^ 

A TC 1TAa|aaTCAGaAaAACAATCACGTTATCAAATGTTAAAIGAA^ 

AAA CCA/ TGTTTTGGCTAATCTTTGCAACGGCAGTGCTC^ 
GCA GGC1 mATTTCTTCGATGGAA AGGAATTGGTTrTA^ 
TAT nGCA DTAGGCAAGGGtGTTTGTGGTGAG^AG^ 
ACG AGCT \TCTGA ACTATATTTCrTGT^^ 
Ct ATG ST CA GTTAfeTTGGAGTrcTCGATCTGGATTCTrCAGAGATrGAGGAI^^ 
TTATITG©AAC$ATTrGTC^ 

. TCjT' :tcg|gcaccaccaccaccaccactga 

^ &CFJS71- ,, hoiirioJ..o.gjie.of. 5?Q.. ID NO. 67" 
^ ATGACAA|TCX3AACTATICACTCC 

AAA rAAGTTGGGATTTrAGGGGGGAATtTTAACCCTGTTCACAATGCCCATCTC 

- -.- ■■ V : ■ - • . ••• 
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Tt T/lCGG<tAACAGTTGGGACTGGA7XAAGTTCTTCTCATO 

6( /AAA' ^CATCCCTGAACACCATCGTCTCAAGAiXJCTTGAGTTGGCAATTGAGGGGATTGACQGCCTAGTC 
A.1TGA/ ACCATTGAGTTGGAGCGCAAGGGTATTTCCTAC^^ 
<^ AC iAAtf JCAGATACGGATTATTACTTTATCATCGGTGCCXJACATGGT^^ 

Ij ^q/xga^tgGTTGACATGGTTC^GTTTGTGGGGGTTCAGCGTCCACGC^^ 

\£ TA TCCA gttatctgggtggacgtaccgctcatggatatctcgtccagcatggtgcgggacttcgt^^ 

AC GXCCIGAAACCCMACnTrcTCCTACXJTC^ 

G/ GCAt CACCjACCACCACCACJGA ,' , -' ,, 

acIStfa "homoiogue of SEQ.' ID NO. 68" ' 

Al GAAiATTGCAAAAATAGTCAGAGAAGCGCGT 

CC CAT! rn-GATGAATTTATCCAATTACATGGTGACCGTTCTTTTCGTC 

1A TTG<& rTGGfoTTGGAGACCAAGCTGTAACAOT^ 

Od rCAA\CGGAArrnX3GCCAACX:ACATCCAGAAGGCTACC^ 

Tfi kOAJ ATTtQGCCGTCCAGTTGTGACCTTTATCAATACAGCAGGTGCTTATCCTCGTGTC 
^ OA ACG1 GGTCAAGGGGAAGCTATCGCTCGCAATCTCATGGAAATGAGTGACCTGAAAGTrCCTATTATCO 
CC \TTA n"ATCGGTGAAGGTOGTTCAGGCGGGOCTCTCGCTCTAGCTGTC 
£k Ob AAAATTCTATCTATGCCATTCTCAGTC 

CC \TGG A^GCAGCAGAACTG ATG AA A ATCACTTCGC . 
TG \TTT< 2TGAAGTAGGACTTTCTAGTAAAGAACTAATTAAGAGTGTCAAAAAA 

GG CTA$3 ACTITCACAAAAACCGCTAGAAGAGTTGCTGGA AGAACGCTATCAACGATTTAGAAAATACCT 

cg \qc^ cgacOacgaccaccactga m 

SL CF £75 "homoiogue of.. SEQ. ID .NO. ?J" 
AT 3TCAtrATAAGATTG^ 
AA 2GGG CGAGCAGACTT^ 

i£ cc- Vltc< jaaaXtcgtaaacgggggCtagaaaaggctt^ 

TC f 'CATC iatgaattggtggtcgatgttgcaaaaagattgggtcc 
at<5Cgt<:ggatttgcAatatcaagccagttttgattactac^ 

^ AT TAT1 TACATAGTCGACCTGAACATCTXTrAfATCAGTTCATC 

S; .GGATAritSCCTCCTATGTTCCCGAGAGTATTTGGAGGAAAGCG 
CCACTOA ' . 

i CFi:^ . "homoiogue. of SEQ. It> t ]NO. 72" . 

atgacgAtttcgtitgtggttat^^ 

aqj lGctAaaagtgaaggagaaaaattagct 
^ tta tac/ atogcttcgaatcttattacagcgttcttggtcgtaataaacag 
<sv ga' aaagato accataagattta^ . 

a a< :ggi7 tctaaggaaaagggagctggcgagattgacaagataaccxttggtcgttatcaagacaag^ 
^ M' ^ctgc igtagttaagtcaggatctgattttt 

GA<iGGC5 TACfCGAGCACCACCACCACCACCACTGA 

• » • 

9CFI78 "homologua-Jaf .SEQ. ID NO. 74" 
ATC TOT^ATCGATAAAGAAAAATT^ 

CGC C^G( ATATpCTITACTGGAAATCAGTGTTrAA ACAATTTATGAAGAA^AAtCAACTOTAGTCATCfTT 
GGC AJM-iTTCGTACKJCATCATTTT^ 

TCA GCAA GGTAAAOGACnTAGTGTTCGTrATATCA AGCCAA ATGCGGAGCATTGGTTOGGTACTGACAO 
O TAA CGG1 AAATiCGCTCITTGACGGtGTCTGGTTCGGAaCTCGTAACtcC 
v CAC TGA1 TAAOTGGTTATCGGTGTriTTGTra 
v ATG A.TCC AAGtHtTACAACGTCATCTCAAACATCCCACCTCTT^ 

CGjGlAGCTlOGAT/rcTGGAATCTGA 

GTl 




gm GXn ottoccAaaaatatcatocctgaattckjtatctgttattgtgacaaccatgactcaaatg 

CAA QCTT fATCTCATACGAAGCCTTCTTGTCTTTCTTCGGTCTTGGAtTAGCGATTACA^ 
GGTpTtfGATjTCXGATTATTCACAMACGTAACAACCAATGCTT^^ 
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' BSCS' TGTCTTGGTATCCTTGTC 

^ ATd ^GA.(pjrcGA jjCACCACCACCAOCACCACTOA . ' • , , 

^ 2 Cflffl "homologjie of SEQ. ID NO. 75" 
AJ( rTATj LACCTATTATTAACCATllrTATTAGTATTATCT 

J^, AC( :aa A vaccaatccagcaatgtatttgatgccagttcaggtgat^gtttgaa 
l£ or t£g> agctgtaatgcagcgtttgacaggg attttagtctttttctggctagccatt^ 
ac( jgt4 ttatcaagtagactcgagcacgaccaccaccaccactga 

fcCFlW . "hotfiologue of SEQ. ID NO. 76" 
ATC 1TTTC GTAGAAATAaATTATTTTTTTGGaCCACAGAAATTT^ 

AG, iCAG \tggGatctttgattaacccttttgt^ 

Gfiocrt CTTTATTATTTGAC^ 

TG< rTATfj TTAA^ACCTTGTGTACTTTGGTC 

GA' TAA' CAGTTATCTAGmOATTATATCTAGTCAAACTATTTATAGTCGAGTACA AGACTTAATCATAO 
AC j T£TC TAAT^ATCCTGCGCTCCAGAATIT^ 

N. GTi GJjLlV .TTGTTCAAAATATCCTA AAtAGCGTATCAAATAGTGTGGGGAGCGTCTrGTCAGCTCTTATCAG 

TA( TGT71 TTGAjnTTOATTATGACTCCAGTrTTt^ 

ATX K2TTC AAA<^AACGATTCTAAAGAGGGATCGCTTGCATATTC 
l> CSi lTTG( 'TCGGTATAT^ 

^ AG' ATT/ ttggtttaaaatAtgctttagttttt^ 

. GG( 5CCAj tGTATTGGTTTGATTCCt ATGATCATCGCAAATATATTCAGTGATC^CCATAGACTGCTGArroC 
AG' GA# TATATCCTTGTTGTTCAGCAGGTAGATGGGAATATC 
TC> AbO' [TCATCCAATCACGAT1TTAGTTTTACTTTTGTTGTGA 
AH GTGC CAGTGCCAACCAATtCTATCTTGAAAGAAATTtCTAAiGTTCTTA 

: ta; A ATj ATG^AAGAACGAGAAAGAGAATTAGCTAAGCTCGAGCACCACCACCACCACCACTG^ 

■ ■ • * * * • 

2 CM 81 "homolo.gue of S,EQ. ID. NO. 77" . 
AT drATtf M WACTTTATCGAAAATATAGAA 

ClAAGA^TXrnjAAACAAGCGGTGGAGCAAGAGAAAATAAGTCACGCTTATCT^^ 
A4C!GGGAaAAACCAGTCTTGCTAAAATCTT^ 

ACC TJjQdAATAACTGCTATATiTGTCAAGC^ 

GC> G<rn CTA ATAAtGGGGTAGATGAMtlXXiCGAAATTCGTGAT^ 
CTC GTTA rAAGGTTTATATCATAGAltjAGGTTCACATGCTGTCTACA 
AeC CXGjJ tAAGAACCAACACAGAATGTAGTCTTTATTTTGGCCACTACTGAATrQCACA^ 
£V TTCt ^TCCCGTGTGCAACGTTTTGAGTTTAAATCAAXTAAGACACaGGATATTAAGGAA 
TAT ATCT TAGA AAAAGAAAATATCAGTTCTGAACCAGAGGCTGTGGAAATCATTGCCAGACGGGCGGAa' 
GOTGGAi XGCGGGACGGCTTGTCTATTTTGGLATCAAGCCCTCAG 

CTC CTAT CTCTGAAGAAATTACTGGCACCATTAGCCTACCAGCCTTGGAtGATTATGTGGCGGCCTTGTCT 



A CAA CAG< ATGT.TCCCAAAGCrtTGTCTTCCTfGAATCTTCTTrrTC 
kfc TOT 3ACC GATCTTTtGCACTAmAAGAGACTKiTTAATro 




teGAACCAGCTCtATt^GGAGCGGTTtSAAAATGAAATTGCTACGCTG 
GOdCGTC ICMACAAQAGCTTTCTA ATGTAGGTGCGGTTCXTA 
CAtiCTACGGGCAAAACAGTCTATCGTGTCGATCGC^ 
$££ AmA GCACGTCAAAATTTAATTCGtT^^ 

GGTGGGj^CGGAjCAAGCTCTGCXJTCGAGGACCACCACCACCACCACTGA ' 

fi^CPEW^ "honipiogue of SEQ.. ID NO. 78"" * 

A 3E 5S!Pfc!J r ^ A¥3CA AtA AGTTAGCGGTAT CGAACnTOATTAAAAACOGCAAACTCTACTATXIGTTTraC 
??J ^ ^ C ^TTGGCAGTCACTGTCACCTATCTCTTTTACTCTCr^ 

^ • AA J rS° T J ®AG6AACAACCATTCAGGCTAGACTTGGATTTGGTATGTTrGTCGTCAi^ 

> S?S SP 3 ^AGAAGCGTCATCTTATCAGTATtjACCTTTAAGGAGTTAGTGGTATI^ 
GAG CGGC TATC^GTATTGGAGCCTTGTTTGACAAOT 
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J2 CFc j? ^ 

AA(iGT1J'ij^uv_njui «>ju jrt^^i iLUAOAyviMAui i u i uai lAt^OTOCTTOTTQTCTTCGGTTTGATTTT 

cc 1 aggcctcAtgttcctgaatgcccttggaatcgcccg^ 

CD GTC&AGAfcAAAAAXGGTCGCTTCCra 
CT7 TrAjlCTTGCCCTTACGGTAAAAGATCCTCTTACA.GCCTTAA 
TA1 ^GG^^ATCTCTTGTTTAATGCAGGGATTAGCGTm 

.AA' 'AOIu iTtAGCAAGCAAATA ACCTCATATCTGTnCTAACTTGATTTTCCGTATGAAGAAAAATGCAOTl 
GG/ iCTAi JCCACC ATCGCTATCTTGTCAACAATGGTTTTGGTA ACCATGTCAGCAGCGACAAGCATTTTCA 
tS. ArrCCTt AGAAAGCTTTAAAAAAGTrCTAAATCCTCATGATTTTGGGOTTTCAGGGGAAAATGTTGAAAA 

* AG/ IAGA :TTGGACAMCTCTTGAGCCAGTTTGCAAGTGACAAAGG 

\U Ar TCG;lTACA.CTTACTTTGCTGTTGCGAATCAAGAAGGAACCAAGTTAACTAinrTT^ 
AA< £GTf<>TCGAACCAAAAACAGTTTTCATGGT^^ 

AA( 3TGT( ITCTATCAGGAAATQaGGTCGGACTCTTTGCAAAGAATGAGGGAGTTA/MlOAACAOAAAGCTC 
TAJ .CTCj] AAATGATCATCMTTtTCTGt AAAAGA AGAATTTACTA AAGATITTATTOTCaACCATGTTCCA 
Af' ^AG'TTAATATrrTGACTG^TGATTACAATTACCTIGTTGTACCrGATTTA 
Vfc. TtC CCA( ATTCG GCTATCTATAATCAGTTTTACG GTGGTATG AATGTAAATGCCAGTGA AGCAGAACAAC 
Tfo AGq' XTGCTGAGGAGTATQAAAAATACTTACAAAAGTrTAATGCTCAATTAAACACTGAAGGTA^ 
TGI GTAa GGTApCACTCTAGCAGATGCrAGTGCTCAGATGAGTGCCCTCTTTC 
GTV ,rm CTTATCCAH'ATCTrTATGGTCGGAACCGTTCTGGTCATCTACTACAAACAA^ 

T^AC^fe^ 

• AV , , ' ' ™ 
. AtC TGCj 

GCi • AA^CTCGjjAGCACCACC^f^ACCACeACi'GA 

^CJpi 83 , 'homologue of SEQ. ID NO. 7.9 rt 
AK AAAAAAGATCAACTAAAGGCTTGG 

AGCIClTtfinrrGTACGGATAAAGTTGGCGTCTrACCATGTGAGAAATGCCGAAGT^ 
<V ^^^JS^ACAGTTITCTCAAGCAGGGATTG^ 

?W 2T7G nrGACTAGCGATGAGGAAAAQATGTTACCGACAATCCGAAGTCGGAdTCAGATOn^TCAC^^ 

£p> TTA GCTA AGTTTAGrCAATCGCGAGCTGAAGCAGAAAAGTPGGCACATCA^ 
TCGATGMAGTGAACGCCTGCTGACTTGGTTAGTAGCTAAGAAAAAAG^ 
AA1 TAGf CAACTTGGCAGATGATAAGGAAAAACAGGATCAGGTTTTACGG^ 
G^GGA<:CTCTTGCAGGTAAGAGTAAGAGTGATTCTACAAGAr^ 

. 5 CJ^S^iJ'homologue of'SEQ. ID NO. 80" .' 

A "TC AAt? CATTTAA AAATTTCTTAAAAGAGTGGGOATTATTCCrocro 

;i^t nxjrtgttaaggacctccctattgaccgttttgatatrgtggtggcccatgagga^ 
cac:act^^^^ ^ 



o 



pi "homologue of SEQ.'.ID NO. 81" 
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c4 



^ijbcCAKGTGGATATTGAAGCGATTAGTQAAACGA^^ 

CGl 

<^'[QT<^TIttiU:QCTAXTAAAAAAATAGA 
AT< irr/jAACATO 

TT< 'AAA' rCGAGGAAGCGATGAGGGAAGAAGCAGGCGCTGATGACCTTGAAACTGAGCAAGACCAAGCTG 
AA \GTC iAGAACTAGA AG ACTTGGGCTTOAAAGTTGAAACOAACTITG 
AA jTAA rGGCtTATGTTCAAACGATrATTGATOAGATGGA7X3TTGAG 

OC( 3TCQ ■ rAGCATCA ATCTACA AATTGACACCAACGAACCAGGTCGtATTATCGGCTACCATGGTAA AGTC 
TT( f AAG }CCirGCAACTGTtGGCTCAAAATtATCTnACAACCGCtATTCCAGA 

^ ca vtgt< ;aat<&attatgtcgaacaccgtgcagaagt CTTGCAGACCTATGCGCAAAAATTGGCGACTCGT 

Gt" TTdC iAAG AAGGGCGCAGTCATAAAAC AGATCCA ATGTCA AATAGCGAACGCAAG ATTATCCATCGT 
Ar ATT CACGTATCGATGGCGTGACTAGTTACTCT^ 
ATaCAQKACT QGAQCaCCACCACCACCACCACTQA 

* , I I ' * ' ■ ' , ' 

2-CFJLH6 i hoinologue of SEQ.. ID NO. 82," 1 

atxbtcaX^^ 

GCj ICA A jGCTQCGGGTaTTTCTATGTTGGaACaTO 
ACj lGTA, LOAGTTGTAGGACACAAGGCAGAATO 
AC] CAA' TCTGAACAGTTGGGAAClXKJTeATGCAGTTATGA 
ga! :a£a :cTTu<STCATTGG^GGAGATACTCCTTTAATCACT 

CQATATi IAATCATAAAA ATOTGGCCACTATCTTC ACTGCTGAAACGGATAATCCTTTTGGCTATGGACGA 
ATTGTTC GTAATGA^ 

aaatcaaggaAatcaacactgg 

cmtaajaataacgctcaaggcgaatactatattacagacgtcattg 

Aju lGTI* XSGGCTtATACTTTGAAAGATTlTGATXSAAAGTC 
CA< iCTG> GTCaGTTATGCGTCGTCGCATCAATCATAAACACATGGTC^ 
AO* lAGG, \ACTTATATCGATATTGATGTTGAGATTGCTC^ 

AAi lGGCS :AAACGAAAATTGGTGCTGAGACTGTTTTGACAAAdGGTACTTATGTAGTGGACAG 
GGi ^GCA< jGAGCGGTCATTACCAATTCTaTO ATTGAGGA AAGTAGTGTTGCAGACGiGTGTGAGAGTCGGT 
CCI TATG CTCACATTCGTGCAAATTCAAGTCTGGGTGCCCAAGTTCATATTC . 
AG< I ATC: TCAATCCGTG AGAATACCAAGGCTGGTCATTTGACTTATATCGGA A ACTGTGAAGTGGGAAGC 
A A< !CTT> aTTTCGGTGCTGGaACTATTACAGTCAACTATGAC^ 
GAC ! ACA; iTGTCTTTGTTOGTTCiAAATTCAACCATrATTGCACCAGTAGA 
GO: OCTC GTTGAACTATrACTAAAGACGTGCCAGCAG 
" ATAAAq'i^GAATATGCAACACGTCTT^ 
CTCA 

aCFE87.» "hoipologue of SEQ. ID NO. 83" 
ATC TQC^AGA^TAGTATTTGGTCACCAAAATCCAGACTCA 

CTA CCTT 3CAA AAGAAGCTTACGGTTTGGATACGGAAGCTGTtGCCCrTGGAA 
GCC TTTG rCTTCAACT ATmGGTGTGGAAGCACCACGTGTt 

AGC AAG1 TATCTTGACTG ACCACAATGAATTCCXaCA ATCTGTATCAG ATATCGCTGAAGTAGAAGTTTA 

CGC T0TT GTAGACCACCACCGTGlTGGCTAACm 
O GTX 3GAi CAGCGTClTC^TCGTrrAOCGTATGITCAA AGA ACATGGTGTAGGTGTGCCTAAAGAGATTG 
^ CAC Gltt 3ATGCTTTCAGGTTTGATTTCAGATA 

AAA ATC/ TTGC?IXX:TGAATTGGCTG 
Vt AAC CTGC TACGAACTTGGCTAGCAAATC^^ 

CAA c6g> aata atgtccgtgttgcccaAgtgaacacagttgacatcgct GAAGTTTTGGAACGCCAAGCA 

G AA ATT<3 AAGCTGCAATGCAAGCTGCCAACGAATCAAACGGCTACTCTCACTTTGTC . 
ATA rCGt :AACf CAAACTrCAGAAATATroGCTCTTGGTGCCAATATGGACAA^ 
TTr< 5 AA A :tTGA AAACAATCAtGCCTtCCrTGCTGGTGCCGTTTCACGTAAGAA ACA A 



<0 



Ta 



7TGA \AGCttTTAATACGCTCGAGCACCACCACCACCACGACTGA 



I- 



? ^CFjCbi '.^homologue of SEQ. ID NO. 84" 

ATQ VTTTCAA AG AGATTAGAATTGGTAGCTTCCTltGTGTCACAGG^ . 
^ GTG VdCAjrGCTj?ATCTGCCTA^ 
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\ Gb+GGAJuK3TCCCTATCAGTCTO^ 

• CC< ilTTAGCCAATGGCTtGGCAGCm AGTOTCrGTCATTACCATTGCTOGCATQ 1 
N> GG'XjOTCGTTTtJATTGCTACKjATTrrAGAAOAAOG 

T^C !AGC( ICAATAATCGTGAAGACGACTTGCGTATCTGGC W 
^ AA< 5AAT :TTAGAAGA AGCTGGA AAGTITTATGAGATrTTGGTGGTGGAAGCAGGACAAATG AAGCTATC 
■w - AGCdX& XJATOTTCGOnrroGTCCCTTCTTGTCC^ 

AAaGAA jCTGAGAAGCTAGAGTTC^CCCTCGGACAAATCCCAGAAAAAAATCTGGAAGAACGTCAAG^ 

ctagtaoataacattcaagct^ 

acvga »] r# . " ; 

I " -I *'* ' . ' t 

ft CFi:» C "hojaologpe, of SEQ. ID' NO. 85" • 1 , 1 * 

atoaax taaacgatattaaagacttgatgactcaatttcac 

IGACGGATGAGTCGCAGTTTaGCAAGAATC^ 
mi TC^CTCCAGCACCCGTTCfAGCAACACCGAGTCCAGTAGCTCC^^ 
£ AOAAGA VGTTaCAGCTCCAGCTGAAGCAAGTGTG^ 

«1 GGAGTG< iTTTACTTGGCTGCTGGACCAGATAAACCTGCCTTCGTTACAGTTQ 

GTC AAA< :ATTGGTA ATTATCG A AGCCATGAAAGTCATOAATGAAATCCCAGCTGCTAAGGATGi3TGTGGT 
^ AM :Gf3A kATTCTCOTCTCTAAOGAAGAAATCGTTGAGTTTC 

AA0C£O( ACCACCAOCACTGA i 

• 1 ! . 

• i # 

"homologye of SEQ, ID; NO.. 8.6" 
ATC AAA< rTAAATGGAGTAGTGGTAACAGGTTATGGAGTAACATCTCCAATCGGAAATACAGCAGAAGAA 
. TTT IGGA ATaG'TTTAGCAACTQGGAA AATCGGCATTGGTOGCATTACAAAAtTTGATCATAGTGACTTTQ 
XlC TGCATAATGCGGCAGAAATCCAAGATTTTCCGTTC 
TTT IGaT \ACTATTCTT^ATaTGCCTTGTATGCAGCCCAAGAGG 

Adc fcfcT tjUtagggatcgtt7tggtgttatcgttck:atctggtattgg 

TCA GGT> CTTCGCCTTCATGAAAaAGGACCCA AACGTGTCAAACCA ATGACtCITCCAAAAGCTTTACCA 
AA1 A1TGC CTTCTGGGA ATGTACKXATGCGTTTTGGTGCAAACGGTGTTTGTA AATCT 
CTC TTCA rCAAftTGATGCGATTGGGGATGCCTTC^ 
TGC G AG( iAACAGAAGCTTCTATCACACrfCHra 
ACA GAG< JATCC,AACTCGTGCTinCGATCX:CATT^ 
CAC GGAv GTTOGTTGTAGAAAGTCITGAACACGCTGAAA 
TGG TTAC GGAAATACTTGTGAtGCXrTAC^ 
^ GCC ATCA AAGTAGCCTTGGAAGAAGCTGAGATITCTCCAGAGGAAGTAGCCTATO 
CGI GAAC TCCTGMXlAATGAA^ AGGAGAAAGTGGTGCTATCGTAGCTO 
Alt ATtjCA ACCAkGTCrrrTACAGGACATTTGCTGGGGGCTGCGGGTGCAGT^ 

GAA fiCW TGCGtGATAACTTTCjTACCAATGACAGCTGGGACA AGTGA AGTATCAGATTATATCGAAGCTA 

* ATG TCGT rrATGGACAAGGTTTGGAGAAAGAAATTCCATACGCTATTO 

CAC AATC cagttcttgctttcaaacgttgggaga 

fcCFiiJl * 'homologue ' o£ SEQ. ID. NO. 87" ■ , 
ATC AACTfTCTATGATCAACTAC^ 

CCT 3ATG rGGt]TTCAGACACCAAGCGTTTTATGGAGCTTTCA AaAGAAGAAGCTTCCAATCGTGACACCG 
. TAA TAGt CTACPGTGAGTaTAAACAAGTCCTTCAAAATATCGTCGATGOCGAAGAGATGATTAAGGAATC 
• AGC C^GiA GATG.GGGACTTGGAAGAAATGGCCAAGCAAGAACTCAAAGATGCCAAGGCTGAAAAAGAAG 

A at atoa agaaaaactgaaaattttgctxx:ttccaaaggatccaaacgatoacaagaatatcatccttga 

AAT CCGT 3GAGCAOGTGGTGGAGACGAAGCGGCACTTTTCGCTGGAGATTfocrAACTATGTACCAAAAG 
TATSCGGtaGQeCAAGGTTGGCGCTTTGAAGTCA^ 

AAC TGGTOGCTATGGTTTCAGGjTCAGTCTGTATACTCTAAGCTrAAGTATGAATCAGGTGCCCA 
rT^ CAA CGTGpTCCTOTCACAGAAAGCGAAGGCCGTGTTCATACTTC 

^ A AC TTGAkGAGGTTGAATACGACATroATCCAAAAGACCTTCG' ' 
TGG rGGA|CAGA!ACGTCAAf AAGGTTG<X3ACTX3CCGTTCGTATC^ 
. GAp fVTGGAGGj&GAACGTACGCAGCAGAAGAAC^ 

GGt jACckCTTjXjCTCAGAtTGCTCAGGATGAAGAAGACGCTGAGCGTAAGTC 
AC£|}TTC^GAACGGATCCGAACTTATAACCT 
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1 



°» IT. 

ir i 



J 



CC TCC/ AAAACTAGATACGATTTTGTCTGGTAAATTGGaCGAAGTTGTC 

AjUp^oaaa^ctagaagaattaaacaaactc^ • K 

^ClfejLv, ."hqmologuo of SEq. ID NO. 88" 
ATGGCdTACACTCTTAAAGCTG^ 

AC Acri acgg aattg aataccaagacgagattatcatcgtcg atgctgggattaaattcccagaagatga 

CI XGCT rGGTATCGACTATGTCATTCCTG^ 
. TX ITA A rCACACACGGACACGAGGACCACATTGGTGGGATTCCGTTCCTACTCAAGCAAGGAAATGTC 
TA TTITk rGCTGGACCGCTTGCCTTGGCTJ^GATCCGTGGGAAACTCGAAGAACACGGCCTC 

0C CAA/ ctttAcgaaatcaaccacaacaccgagt^ 

CG ACTC ACTCTATTCCAGAGCCTTTGGGGATTGTCATTCATACTCCTCAAGGGA AAATCGTCTGTACGGGT 
, OA CTTT M.GTTCGACTTTACTCCAGTTGGAGAACGTGCGGACTT(WA 
AA GGCjC TGCTjCTGrCTCCTGTCTGACTCGACA AATGCGGAAGTACCAACCTTTACCAACT^f GAAAAAGT 
CG TTGvrcAGTCCATTATGAAGATTATCCAAGGTATTGAAGGACGTATC 

/fT ATCT POCGTCTCCAGCAGGCAACAGAAGCTGCTGTTA AGACTGGACGC^AGATTGCGGTCTTTGGTCO 
W -TAT< JGAAXaGGCCATTGTCAACGG AATCGAlXrrTQGCTACATCAAAGCTCCTAAGGGAAC^lTrATC 
GA OCCf AATGaAATCAAAGATTATCCTGCAGGAGaAGTTCTTATC 

CT VTbG dagccctctctcgtatcgccaacggaacccacggtcaagtacaattacaaccaggtgat^ 
ta nqr# jtcttictagtcccatccctgga^^ 

CT< jGiG rCGA^GTTATCCACGGTAAAGTGAACAATATeCATACATC 
JS^ AA AAAC rCATbCTGCGCTTGATTAAGCC AAAATACTTGATGCCTGTCCACGGTGAATACCGCAf GC AA AA 
V£ Ab TCCA SGCTGGACTAGCAGTGGATACTGGTGTTGTGAAGGAC^ 
GT 3CTT< >CCCTTACTGCTGACTCAGCTCGTATCGCAGGTCATTTC 

AA \TOG rATGQGTGAAATTGGCGCAGCtGTCCTCAAAGATOGTCGCGATCTATCTGAAGACG 
CP IGCA jTCGCAACTGTTGACTTCAAATCGCAGATGATTCTGTCTGG 

. tt< }tct/ cato agagagtgtggagacttgattcgccaa agcc^gcgtatcctcttcaatgcgatxcgtatg 
gc *ctq, usaaataaggatgctagcgtgcaatctgtcaa' — • ~~ 
xc' atgilaaataccgaacgtgaaccgatcatcatoccgi 
ccAcca< x:accagcactqa 



4 A 



atc|k^Acgg~ca^ 

AA< SGCCi VAGACGATTGAAAAATATCTAGGGAGAAACTACAAGG 

GA* ttg> agaaatccagtatotccgtcgatat^^ 

GAi lAAG 3CCCTCTTATGAATGACTTGAAA AAAGAAGCTPAaAa AAGCTAATAA AGTITTTCTCGCGAGTO 
ACC ICGGi (CCGTG A AGGAGAAGCGATtTCTTGGCATTTGGCCCATAThrc 
,CCCJ' GTGChnCtrCAATGAAXTC^^ 

GAVAtGGACTTGGTCGATGCCCAACAAGCTCGTCCkSATCTTGGATC 

CW T7TT 3TGGjiAGAAGGTCAAGAAGGGCTTGTCAGCACGtCGCGTTCAGtCC^ 
CA1 TGAC CGTGAAAATGAAATCAATGCCTTCCAGCCAG 
A A( rGGA: kCOlAACAATtrCATGCTTCCTTCT 

AAC GAA< jTCAAGGAAQTCTTGTCTCGTCTGACGAGTAAAGACTTT^ 
GAC 1CGT/ AGCGGAATGCTfXSTTTACCCTATA 

ATI TCCG rACTtGAAAAACCATGATGGTTGCCCAACAGCTCrATGAAGGMW 
. TCAAGGlTTGA,TTACCTATATGCGTACCGATTCGACICGTA^ 
J>. AGC TpA ITAC^ATCGTTrTCGTACfcAAGtAra 



CTC AGGA TGOCCATGAGGCTATTCGTCCGTCAAGTCn"CT 
GGA CAA< 'GATCAGCTCAAGCTATATACCGITATCTGGA^ 

GTT rTTO, VTACOATCGCTGTTAAATTOTCTCAAAAAGGGGTTCAATrTGCTGCCAATC 

GTT K3AT 3GTrATCTTGCCATTTATAATGATTCfGACAAGAATAAGATG^ACCGGACATGGTTGTTGGAG 

ATG KJGT 2AAA^GGTCAATAGCAAACCAGAGC^CATTTCACCCAACC^ 

AAC ACTG ATTA4AACCTTAGAGGAAAATGGGGTTGGACGTCCATCAAGCTACGCGCCAACCATTC 

CAT "CApAAAGGTTATTATGTTCGCCTGG^GCCAAACGTTTTGAACCGACAGAGT^ 

AAT \ AGC TCATtoTTGAATAmCCCAGATATCGTAAACGTGACCTTCACAGC^ 

TGG *.TGA rGTCGAAGTGGGAAAAGAGCAGTGGCGACGGGTCATT^ 
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r (fitted 



AAQl'TCCdAA<KSCTGAAGAAGAAAT(K3A 

A! AGI GTGTGGTAGTCCAATGGTCATrAAACTTGGTCGTTTrGGTAAATTC 



.0 



TG^G^GTCTGOTAGT^ 

cA Gkn gccaS CATACCCAAX3CAATCGTGA AAG AGATTGGTGTTGAGTGTCCAAGCTGTCATCAGGGACA 
AATTATrGAGCGAAA^ACCAAGCraTAATCGCCTATrCTATGGTTGCMTCGCT 

AC CTCT rGGGKCAAGCCTGTroGTCGTGACTGTCCAAAATGTGGCAACTTCCACATGGAGAAAAA AGTCC 
U: OT 3GX^TGG^AGCAGGTTGTTGTAGCAAAGGCGACTACGAGGAAGAAAAGATGGCTCTTTGTCAACT 
GC rcp^ACCACCACCACC^CCACTGA 

teem "homologue 6f SEQ. ID NO. 91" , • ' 

AT 3TTTATTT(tCATCAGTGCTGGAATT G ^ 

ihr [TfrA- "AGA/^GGCGCAAATrACAGGCCAGCAGATGCATGAGGATGTCAAACAGCATCAGGCAAAAGC 
T G 3GA€ rcCTACAATGGGAGGTTTGGTTITCTTGATTACTrrcTGTTTTC 

AD TAG t CAATjrcAGCAATAATGTGGGAATGATTTTGTTCATCTTGGTCTTGTATGGCTTGGTCG 
AG 4T0 A CTTTCTCAAGGTCTTTCX}TAAAATaA.ATGAGGGGCTTAATCGTAAGCAAAAATTAGCTCTTCAG 
CTA< iGTGGAGTTATCTTCTATCTTTTCTATGAGCGCGGTGGCC 

CAT TGGfcATTTTTCTATATT^ 
l GAC jGTGTTGACGGTTTaGCTAGTATTTCCGT^ 
^ iu . GCA CGGTCAGATGGATATTCTTCTAGTGAtTCTTGCCAjTGAlTGGTGGTTTO 
I * TAAGCA rAAfiCCTGCCAAGGTCTTTATGGGTQATGTGGGAAG^ 
l > A1TCTCT/ ^TGGCTCTCCACCAGGAATGC^CTCTCTTGATTATCGGAATTGTO 

TG' TAT( JATGCAAGTC^GTTATTTCAAAGtGACAGGTGGTAAACGTATTTTCX^ 

AC :ATT TGAGCTTGGGGGATTGTCTGGTAAAGGAAATCCTTGGAGCGAGTOGAAGGTTGACTTCTTCTT 
Tf < iGGG, IGTTGGTCTTGTAGCAAGTCTrcpTGACCCTAGCAATrrTATA 
AC0aC5QaCTGA; ' 

£CFE9« "homologue of SEQ. ID NO.' 92" ' ' . 

AT$GCAtoCGAATTTTCACTTGAAAAAACTCGTA 

cL kCAACTACTGAGCGTATTCTrTACTACACTGGTAA AATCCACAaAATCGQTGAAACTCACGAAGGTGC 
GT<!ACA!^tGGACTGGATGGAGC^AGAGCAAGAACGTGGTATCACGATCACATCTGCTGCGACAACAGC 
TO ATG<BAACAACCACCGCGTAAACATCATCGACACACCAGGACACGTOGACT 
• AC<rTTCTCTT<2GTGTATTGGATGGTGe^^ 
AAj iCAGTinrGGCGTCAAGCAACIX}AGTACGGAGTTCCACOT^ 
CdGTGG' GACTaXX^tTTACTCtGTAAGGACACTTCACGATCGTCTTCAAGCAAATO 
T6C CAAT CGGTtrCTGAAGATGACTTCCG.iroGTATCATTGACTrGAtCAAGA.TGAA^ 
TAJ .COM JCTTGCTaCGGATATCCTTGAAGAAGACATCCCAGCTCaX 

CGI GAA> .AATTGATTGAAGGAGTTGCTGAAACTGA CGAAGAATTGATGaTGAA ATACCTCGAaGGTGAA 
GA> ATCACTAACGAAGAATTOAAAGCTGGTA^ 

TGI GTGC TTCACCC1TCAAAAACAAAGGTGTTCAATTGATGCTTGATC 
CO CTTC ACWTCCCAGCAATeA^ 
W ACC AAQ^ tGCCATOraCAGCTXTTGCCTTCA AGATCA TGACTGACCCATTC^TAGGTCGTTfGAC^TTCTTC 

CQ7 GTTT \GTCAGOTGTTCTTCAATCAGGTTCAT^ 
/fc>> CGC ACGT ATCtfTTCAAATGC^CGCTAACAG^ 

GCTteGCC ttgc?ittgaaagatactacaactggtgactcattoacagatgaaaaagcta^ 

AGT CAATZ CAAOvTTCCAGAACCAGTTATCCAATTGATGGTTGAGCCAAAA 
GATGGGIATCGCCCTTCAAAAATTGGCTGAAGAAGATCCAACATTCCGM 
GG7 GAA> CAGTTAICTCAG^TATGGGTGAACT^^ 

CAW AGTI 0 A AGCG A ACGTAGGTGCTCCTCAAGTATCTTACCGTGA AACATTOCG CGCTTCTACT.CAAGCA 
CGC QOAl TCTTX&AACGTGAGTCTGGTGGTAAAGGTCAATTCGGTO 



agc odrr qaaaIXaggtttggtagaatctatggctaacggtgttcttgcag^ 

AA!A QGTA AGCinTATGATGGtTCATATCACGAtGtCGACTCATCTGAA ACTGCCTTCAAGATTQCGGCTrC 
ACT rrc6:TTAA>GAAGCTGCTAAATCAGCACAACCAGCTATCGTTGAACCAATG 
ACT 3TTC 2AGAAGAAAACCTTGGTGATGTTATGGGTCACGTAACT^ 
TGG \AGC ACACGGTAACAGCCAAATCGTTTCGTGCTIAC^tTCCACTTGC 

ACT rcn?< XSTTCTGCJATCT^AGGAajTGGTACATTC ATGATGGTATTTGACCACTACQAAGATGTACCTA 
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* ^"AWTCA(iTACXAQAGAAATTATTAAGAAAAATAAAGGTGAAGACCTC 



a cf ii 



homologue of SEQ. ID NO. 93" 



AT 3GCA A ATTACaATATTCC ATTTTC ACCGCCTGATATC 

CC ;TGC jTTC^GTTGGATCACAACAGGTCCTAAAACAAAAGAACTGGAGCGCCGCTTGTC 
AC \GAC ACCTMGACTGTTTGTCTCAACTCTGCGACAGCCGCTC 
TG aoACCTGGMATOAAGTCATCfiTTCCAGCCATGACCTATACGGCTrCATO 

gg 3 agc aagocctgtcatggtggatatccaXggapatacgtttgagatggactatoacctgctt 
CC tatc vctoagaaaactaaggtgattatcccagtagagctcgcagggattgtt^ 
tc rrcc lagtcgtggagaaaa AAGGTGACTrCTTTACCGCTTCA agcaagtggcaaaaggcctttaaccg 

7a{tGT< JATTGTCTCTGATAGTGCCCACGCITKKjGATC^ . 
Crp AiTT TACTiTCCTTCTCATTCGA^ 
^ AAKCJCC AATCCAGTGATTGATGACGAAGAGATGTACAAGGAATTCC2AaATCCTT^ 
fee crMGGATGCTCTTGCCAAGATGCAACTGGGGTCATGGG^ 
\ V> CA \CAt 3ACCGATATCATGGCTTCACTTGGTTTGGTACAATTGGACCGCTATCCA 
1 Ofr %AGQ ACATJGTOGAOCGCTATGATAGTO 

TOi VAAC rGTCQAATCTTCACGCCACCTCTACATCACCCGTGTAGAAGGAG 

cr< :ato ^tccaagaattggctaaagcaggaattgcaagtaatgtt cactacaaaccgcttcctctcttga 
c^gccx ita^gaatcttggarrtgatatgacgaactatccta aggggtatgggttgtttgagaatgaaatt 
a£cctc< jctctitcatactaA attaagcgatga^ 

ctc ; aaa wvgt6ctaactttatcaaaaaaactcgagcaccaccaccaccaccactg 

* * • * *■ * * 

I * • • , 

* ■ * ... 

fiCFU»$ . "homologue of SEQ. ID NO. 95" , 
AT( yrjTtATACTTATTTGCGTGGATTAGTTGTAT^ 

Ak 0 "ACT( }ATAAAATtCCTAATCA AGATGAAAATTATAlTITAGTrGCGC 

CTC rTTT^i TATGGCCTTTGCGACCA AGCCAAA ACAGTTC ATCTTTATGGCAA AAAaAOAACTCTTTACCAA 
^ ' CC< rTAT< ITttGGTrGGTGGATTCGTATGTGTGGCGCCTTTCCC^ 

^ CC> TCA; ATATCCTATCAACGTTCTCAAAAA AAGTG AGCGCTCTCTCATCATGTTTCCAAGTGGTAGCCaC 
. * . CA< JTCAJ ACGATGTCAAGGGGGGCGCAGCACTGAYTGOCAAAATGGCCA^ 
^ ACC TAGi .CCGGTCfebvTGACTTTGAAGGGCTTGATTAGCCGTGAACGTGTCGATATG^ 
V{; CA/ TCtii T ATCJTCAG ATATCAAG AAAATGA ATGATGAAGGCATTGAAACAGtCGCCAATCGTATTGAAA 
CAC i AATf raAACGTCTGGACGAAGAAACGAAACAATGGCACAAT 
G<5- TTAflCCGcXTCCCTGCCCTCATCCTI^TATTATCGTCGCTATCC 

CAi QCVn CATCTGGAACCCAGATAAGAAAAGAGAAGAACTTGCACTnCGAGCACCACCACCACCACCACT 
GA . I i! 

ZOF)EI01.i A homologue of SEQ. ID NO. 97" 
ATC ACC^CGAAtTTtTACATTTTGAAAAAATCA 
CAC CtCC rTTGACAGAAGAAGAATTGGAATCTATCAAGAGT^ 

TAG AGA1 AT CTATCTOCCGTTX3GCrCATCTCATTCAGATTTACA AGCGAACTAAGG A AGATTTAGCCTTTT 
CAA AXfiC IAATT()nrCCTCCAACGTC 
v GTI 3<BA> AATCCACAACCAGTCGCCTACTTCAAATCCTaCTO^ 
Vi AQ1 TGG1 TACAACTGATGGTriTCTCTATC^ 

QQA THTC CTGAAAGCTATGATATGGAAGClCTTPGTCAACTTCTro 
TAOAtAl rC^TGTCTATTCTGATGAAGrrTACGACATCGTACCCGA^ 

TgI: TGAGiCTAGCCCAAAACGACCCTGATAGG^^ 
AAaTOG/AGCGffTTGCCCATCAGGlCTGGAC^^ 
AAC C^aJ* AATCjGTGCAG AAGTGATTCTTCATAA^ 
AAA GCTC G AGGACCACCAGGACCACCACTGA . 

T • n • • . • ' . - • ' ' 

SLCFf tOi "homologue of SEQ. ID NO. 98" 
^ ATG jAAATTTCATTaTTAACAGATGTTGGTCAGAAACGAACAAATAACCAaGACTATGTCAACCaCTATG 

jck ' lCh ' ^ T AGf gc^gacgtaccatgattattttaqctga 

& ' " .. .. 



.■^ TGAllTTT STAAI^GTTGAGGGAATCAATOTCTTrcAAAATCCACAAAACGATCGrGTC 

nTn^CATWATGTAGATGCTGGAGTGGATGATATTGAAAGTTGGTATCT^ 
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CG 



3 CF£ /o2- 

£ TAG TGA AATOOCGGTCACAGACCTGGGTGTAGCTrcGGTTGATACCCAGATCGATACAOTCAATOAAQTO 
5 OG TG A/< TGCTjtCGCCCATTACCTAGA AATTGAA A ATCAAA AGATTCACCAGCTTGGTCAGGATGA AGCTT 
XsL AC \GAf GCA^GGGAACTACTrrGGAAGTCCTTGCTATTATTGATAATCAGGCTATCTATGCTC 
N> OA TTCG CGTATCGGCTTGATTCGTGG AG AAG AATACCATCAGtTQACGAGCGATGATTCCTTGGTTAATG 
^ AA TTGfc rCAAGGCTGGTCAATTGACACCAGAAGACK3CAGAAGCTCATCCGCAAAAAAATATTATCACCC 

ag tcta nGGGCAAAAAG atgaaattcagcctgattttgggacagttat<x:ttgagtcag^ 

k . .... 



I 



^ ctJtc^t|laatagtgacggcttGaccaacatgatttcaggcagtc 
at rdpf [tagcagataaaacggagacacttgttcgttttgctaac^ 



STTG^CCflrilGTrrCTATO 



TEG 



£ C*ill03'{« h hom6Xoquew Of SEi 



•99 



If 



v». r w-m GHl o i0 quew.ot seQ» ID NO, ^, . 
AT jACGATACAGATGAAGAATACAGGTAAaCGAATTGATC 

AG 3 QTtWgtAtGA ATTGCGAGATCGTTTGA AG AGAA ATCAGTT^ 

TT< JTCA' TTCX2ATTGGCGGGGATGGTATGCTCTTGTC 

GT :CGG rTTATCGGTCTTCATACr GG ACATITGGGCTTCTATAGAG ATTATCGTG^ 
^ GC TAGT jACTAATTTGCAACTAGATACTGGGGCAAGGGTT^^ 

X. TTl J AAA VTGGTG AAGTTA AGATTTTCAG AGCACTCAACGAAGCCAGCATCCGCAGGTCTGATCGaaCCAT 

^ Gb rcjG* vgatattgtaataa atggtgttcxxtttgaacgttttcgtggag acgggctaacagtttcoaca 

CQi 3Afct< JGTAGTACTGCCTATAACAAGTCTCTTGGCGG 

V£ AT "AAQ sgagattgocagccttaataatcgtgtctatcga 
AG 3ATA vgattgaacttattccaacaagaaacgat^ 

ttc :cgtv ATAlTGAGCGTATTGAGTATCAAATCGACCATCATAAGATTCACTTTGTC 
TA< JCAG TTCTGGAACCGTCTTAAGGATOCCtTTATCGGTGAGGTCGATGAA 
CX(:CAQ'X1A . ^ . * 

QCP):irf4 , ^'homolog^e c&f- SEQ. ID NO; 100" 
ATi fTCaL LAAG AAAFTA AATTTTCATCAGATGCCCGITCAGCCATGGTW 
AC; iCTG' TAAAGTAACCTTGGGACCAAAAGGTCGCAATGTC<m , CTTGAAAA 
GA* TAG( lAATGAGGGTGTGACCATTGCCAAAGAAATCGAATTGGAAGACGATT^GAAAA^ 
AA< 3tJA< jTATCAGAAGTAGCTTCTAA A ACCAAT G ATATCGCAGGTGACGGGACTACGACTGCA ACAGTQT 
TQJ .Cffid ^AGCTATCGTCCXH'GAAGGAATCAAAAACGT^ 
T<J( rGATf UA AACAGCAGTTGCCGCAGCAGTCGAAGCTTTGAAAAAC^ 
AGj LAGCS rATCGCTCAAGTTGCAGCCGTATCTTCTC^ 

ATC GA/J, lAAOiTTGGCAAAGACiSGTGTCATCACC^tCGAAGAGTCACGTGGTATGGAAACA 
Gtt GTaX fAAGiAATGCAGTTTGACCGTtJGTTACGTTTCACAGTACATO 
TGC ^CTGj^ rCTTGAAAATCCGf ACATTO 
ACTT^GGAAAGCATTCT^ 
CTTjCCAA CtdrrGTTTTGAAC^ 
^ TQACCGG CGCAAAGCCATGCTTG AAG ATATCGCC ATCTTAACAGGCGGAACAGTTATCACAGAAGACCTT 
GG1 CTtC AGTT£ AAAGaTGCGACAATTGAAGCTCTTGGTCA AGCAGCGAGAGtGACCGTGGACA AAGAT. 
■ £> AQC ACGK nrTATTOTAGAAGGTGCAGGAAA^ 
V V AAJ TC$gI AACTaCAACTTCTGAATTTGACCGTGAAAA^ 

TGT AGCC GTTATTAAGGTTGGAGCCGCAACTGAAACnX3AGTTGAAAGAAATO 
TGC CCT£ AACG£TACTCGTGCAGCTGTTX3AAGAuAGGTATTC 

GTG ATTf CAGClGTTGCTACCTFGOAAr^^ . 
CTT rGGA AGA ACOCGTTCGTCAA ATTGCTCACAATGCAGGATT^ 

AAA AATC fCTGAGCTTGGTATAGGATTTAACtoCA . 
AT»C ftTTTG ATCCAGTTAAAGTGaGT CGTTCAGCC^ 

CAA cj&qV AGCAGTCGTAGCCAATA AACCAGAACCAGTAGCGCCAGCTCCAGCAATOGATCGAAGTATGA 
TOG GGGfc GATGATGCTCGAGCACCACCACCACCACCACTGA 

■ . W ii • ■■ • . 

JLCFteLOS "homologue of SEQ, ID NO.. 101" 
ATG fcTT^GATTGAAACCGTATTAGATATI^AAAG 

^ GTC \TT^CACTACAACTACAGCAAAGfTATTTTTGATAGCATCAGCTA 
AG^A CACTCTTTpT^ 

^ TAGpTTqjSTAT<j™ 
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jo cf€ Ao<u.e /A 0«< f&i /o mo; /of 

ao< ;cat< sagtttgattgccatggagttctatggtaatccacaagagaaact^ 
og' "acx ^ggotaagacaacagcaacctatntxj<xtataacatcitatctcaagggcata 

TO^GTXjGACG^AAeACAACTCTTGATC^ 

TTTOAGATGATGAATCAGGCTGTGCAAAATGACCGTACCXJACCT^rCATGGAAGTCTCC 

rATCTAGTCCATCGAGTCTATGR XC.THA rrTTTft A Ttvrn rtn * n-nr~m~rr»wT. • . — 



GA 
C 



^ TO'TGTOGAO 

$ TA r TGA< JCtCTi xjuaca IXiA I OAATCAOUCTGTGCAAAATGACCGTACCCACCTCArCATGGAAGTCT CC 

. < Ac gaa scctatctagtccatcgagtctatggactgacc^ 
\3, ig> iCtA" 'atc(5gcccg attgaacaccctagctttgaagactatttctaccacaagcgtci<^^ 

aa' agc :gagg^tcatcattaacagtgacatggaccacttctcagtcttgaaagaacaggttgaagatc! 
£ aa< }ac<? \tgatnnrctatggtagccaatttgataaccaaatcgagaattccaaa 

^ AgrtOOT- ^AACtCGCTGGAGATTATGATATCCAACTCATTGGCAACTTCAAGCAAGAAAATGCAGTTGCTO 
^ C|X IGAC TGCTTGTCTCCGTCTCGOAGCAAGTCTTXUGGAC^T^^ 

.CTQG' ■CGTATGGAAGTCCTCACTCAGAAAAATXjGAGCCAAGGTCTrCATCGACTATGCCCACAATGGG 
:AGT( -TGAAAAaACTCATCAATGTGGTTOaA^CTCATCAAACCGGA 



•A' 'ACT* rTGAAAAAACTCATCAATGTGGTTOAAACrCATCAAACCGGAAAGATTCCTCTGGTTCTGGGAT 
A> iCAG 3AAACAAGGGAGAA AGTCGTt GTAAGGACTTTGGCCTCCTCCTCAAtCAACACCCTOAGATTC 
A( nrtf. TCTGACTGCTGATGACCCTAACTATG AAGACCCAATGGCCATTGCAGATGAAATTAGTAGCTA 
c\r. w CATCCTGTTGAaAAGATTGCGGATCGCCAAGAAGCCATCAAGGCGGCAATGGCTATCAGAaa 
tc; CGAv iTTAGATGCAGTTATTATTCCGGGTAAGGGAGCCGATTGTTACC 
GA, LTpC 'ACCCAGGAGATACAGCCGTCGCAGAAAATTATTnTACTCGAGCACCACCACCAt 



^ l »*«» C I ACCrAOCCAAAGACTTA ATCTTAGATGGGGAAGAAGT ^GG^GTG^GGTTOTAGGACfcA 
TCaYOAlATCGlTCGGATAAC^GAra 

GJ Jgga< !TAGACCTCAAACGCTATATCAAGGAACaTTATCCTCTTTCTAATGA>sS 
TQdGACi.AATTCTCTTGG^ATGCGCTTGGCCCATACTTOAGG 

A M r ^ !Cm tP A ^^AGATGGGACTGCX:AAGGTCACAGACTTTGGGATrGCrGTAGCCTTOCAG^ 
AcAA<feT<:TCACXCAGACTAACTt:GATGTTC^ 

CoXAGdOGACTGTGCAGAGTGATATCTATCCCATGGGGAtTArr^ 
CCC mTQACGGGGATAGCGCGGTGACCVVTTC^ 

CA<A A ^aTCATCTGTACCTCAGGCTTTAG^ 



TOdcTAjcGCTCGGriTCAGAGATGTATGTAGACTTGTCTAGTAGCTTO^ 
.^AATCTTTGATGAAACGAGCAAGOCAGATACCAAGACCm 

5? a ™£ taato ™^ a gcc^^ 



GAC IATC1 ATTCCTA AGGTTCA AGCGCAGACAGAACACAAATCA ^ATCAAAAACCC^G^^GCTO wap 
^ ^ ^ 1 AC ™aVUa*CAAGC*CX:GA A AAA^^ 
^ AGC ICTTC TATTGGTGGCAGCTTCTCTTATTTGGATACTATCCAGAACT^ 

^ ZVFS?- qgtcaga ^^ ca gagg^^ 

GA/ GACi ^AGG^AGTGAAAAGGTGGAAGMGGGCGGATTATCXJGTACA^TC^ 
TOG AAA/ GAA6GAACGAAAATTAATCTGGTTGTCTCATCAGGCAAACAA 
GTC GGCC GGAAATCTTCTGATGTTATCGCGGAATTAAAAQAGAAAAAAGTT^ 

TTG AGG/ AGA AGAGICGAATGAAAGTOAGGCTGGAACCGTCCTGAAGCAAAGtoA^ 
ACC TATG ACTTQAGCAAGGGAACTCAAATTOTTTTGACAGTAGC 
GG/ACTATATTGGACGGaACTCTACAG 

TGA TTAA GATaIGAGGAAGA AGAGTOCAGCX3 AAAGCX}AAO:AGGAaC^^^ 
GCC GG& .COACTTATGATqTCAGTA AACCT^^ 

ttd :cat mcgagttacattggttccagcttcgagtttactaagAac 

AAC GAA( ICTAATATAGaAGTTGTAGAAGTGACGACAGCGCCTGCAGGTaGTC^ 
^ ^ TCC TAGAGCAGGTGAAAAGGtAGACCTAAATAAGACTAGAGTCA^^^ 

car *AAA|CAAcjprcAGCTACT^^ 

ZCvAdi v "hortK>logvte of SEQ. it> NO. 103" 

tta •gga^gaiaaaccctcctcatcgaaaaamtcggaaacttggaaaaaAattag^ 
gmIaaga^aa^actgaactaggtggtcaagttgctactcaaatagaa^ 
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* , 1 \ * • * 

GA KJAC ^AGTTTGTXICTTAAGTCAACGGATCAAACCTTCACTTGTAAGAAACrcArrGTCACAArAnnTn 
- ^V^TTATCCTTCGACTGGITCOACTGGCTTO^ 

\& 2° ^$4[ GTGACCCTAAG ^ ATGQTAAG ^ TQ ^ TCAC TCATGA^ 

^ CA 3GTC n"GCTGGCCTACGCATGTCTAGCrrrGTCAAAGGTGGGGAGGTTCTCTCA^ 
^ CA\OT:CTGAGAAGGACTTGGTTACATTT^ 

^ ^CCT rGTTACCAGAACGCTTGGCCGAATTlTTrGTACAAGGArATCCTGA 

if Ctr li a ^OTCCTTTGTTaCCAAGGGTGGAGT^^ 
* A 3CfoG STACCTGGCCTCCACTITCCAGGCGAAG^ 

fiCP|lO« "homplogue of SEQ. ID NQ.;104 ,, ' . . 

^ ^ AAGTCrGGTGAC " CTGCTAA CGATAGCCGTATTACC^ 
TCC GTrt GGACGAACTGCCTCAATTGGTGAATGTCC^AAAGGTGAGAT^TCCTTTCTC 

£.CFi:*Q> "homologue of SEQ. ID NO. 105". * 

* • . 



cvi 



^CFEll;i ''homologue of SEQ. ID NO. 107 



goJaatc cgta^cgc^ 

S££ ^GAGjCCAATCAGGTCAGClTrACCTAaAAGCAGGGGCT^ 
TTG 3Tfc C toTTTTCCGTGCTGAAAAATCAAAAAC^ 

*k aoWtgactacgagcatcttcagcatg^^ 
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ife "hoznologue of SHQ. ID NO. 108" ' 
CG(jTTaIuAAATGCTAAC 

ga-uuvcLvctattggtc^^ 

ca/ ^gktraatggtaagaaatcgcgtga acaagctgcggaattgatkxjtcgaatccagatgarmcc 

AAOACCpTGC&CAAGTTO 
CG' TTAT^A/jbGATGAAGAAG 

CTC AGCACTTdACTCGTTACCCTC^TGAATTCTCAGGCG 

majci tgcaaccagactttgttattgcagatgagccaatttcagc 
ag' cttc aacttgctcaaaaaattccaaaaagagctc^ 
c6< ittc tcgctttafttcagatcgtatcgcagttatttacaagggtgt^ 

GAj KQAA rTQTWAACAATOCAATICACGCA 

Ml X2TTC GAACPGTA AGAAGGTCTTGA AGGTTTACGACCCAAGTCAACACGACTATGAGACXGATAAGCC 
At ( TATC GTAG AAATCCGTCCAGGTCACTATGTTTGGGCG A ACCA AGCTG AATTAGC ACGTTATCAAAAA 
GG j iCTAi lACCtCG^GCACCACCACCACCACCACTGA 

2 CFllijl . "hbmolocme of SEa, ID'-NO, 109" ' • 

ATC ^AAdATAAGTTGGAATGGATTTTCTAAAAAATCATACCJ^ 

OC< iCTCC TTAGTCCTGAGaGACAAGCTAGTCTGGAGAAGGAT . 
Gki iCTO; L GTGAGAATGTGGMOOAAClTlTrc 
TCA GGJ\J L TACACCGTTCCCTATGTGACAGAAGAACCCTCTC 

ATC ATFC'A AGCGtGCAGaTGGTTTTACTGCACAAGTCCATCAGCGCCAGATGATTGGGCA 
ATC AAG1 TGCTp^TCCTAAACTAGCGCAAOAOAAGATTOCCAGCA^ 
CCJ ATC> AGCCTArcCTTCTATCG 
A*( iCGA- LC^6ACTTTCTCGTTGf^ 

AjVC ACCJ .TGCTGGAAGCCrrGAAACCAGTCTTAGAAQAACTCAGTC^ 
CTC TCC/\ ACTAtGCGACTGATTClTCTGGTGACTGCAAGCTGTCGCA 
^ GGA' CA A&GACG AGAGATTGCGGAG AAA ATTGCGTrGGCTAGTCAGTTTGCGGAGGCTGATCCTTA 

f CCC AGC3 GQTACTCATAATAAAGGA ATTTTTAATC^ 
GGC GTGC CATCGAAGCTGGGGCCCATGCCTTTGC^ 

G AC GCTC GaCCTTGAAAGaGAAG AATTGGTCGGtGAGATGACCCTGCCCATGCCTGTAGCGACTAAGGG 
TGG CTG7 \TC<jfecCTCAACCCACGTtjTAGCTCTCAGTCATGATCTACTAGGAAA 
TAC CCCJ4 GAT^TCGTGTCCATCGGTCttGCTCAAAATT^ 

ATC CAGC AAGGCCACATGAAACTACAGGCGAAATCCCTAGCTCTCCTAGCTOGGGCTAGTOAATCIXSaAG 
TTCCTCC :CTAGTAGAGCGCCTCATCrcAGATAAAACCTITAACCTA 
AAATTTAAGATCA^COGCCXiCACTCGAGCACCACCACCACCACCACTGA 



h: 



A^GGAC^GAclftrcGAACT^^ 



2CFEI14 "homologue 'of SEQ. _ID 
ATG CGAaTTACJATCATTAGA Ai 

AAC tcg/ tgaa^ttitagatatixstggttcgtgattacgaaoatcttgtgcgt^^ 

TTT 3CGT VTTAAGAGriTAGAAaAGCGTTTCTCTTACnrTTGATGAAA 
^ TAT TOAT rGCTGAGGATACAGeTOAGAGAGTGAAACAGGCGGCGCATG^AACGTTCAA^ 
. ATCM.aCAGAGCAAGATGCGCAACGCITGTTGGAAGAAGCTAAATAT^ 

^ AAC CAAC tgataatgctaagaaagtcgctgttgaaacagaagaattgaagaacaagagccgtgtcttcc 

jtw ACCAACCTCTC^AATCTACAATTQAGAGTCA^ 

Up^- CGT :CAA CAGCrrACTTATCrTCAAACCAGTGATGAAGCCTTTAAAGAAGTGGTTAGC 
. AAC CGA1 rCC ACKTTCCVUVtTG AAGAA 

AGA ATTA CA AGCTCGTATTGAGGTaGCCGATAA AG AATTGTCTO A ATTTGAAGCTCAGATTAAACAQGA 
AG^T 3GAA ACTqi^CrCCTGTAGTGAGTCCTCAAGTTGAAGA AG AGCCTGrGCTCATCXJAGTTGGCCCAA 

• TGtt", VT^|i| \gaaJccagaagctc^agcaccacjgaccaccaccacto 

^•aCFELlS ''homologue of SEQ. ID. NO. 111" 
^ . ATG rGTAXTGGXCAACTAATrrATTTAATGGTTC 

^ AAT ^TT^TAC^YAAGC^AAACGAGGGGAGATrAGAGGCGCTAGAAGAAAGAAAAGAAGAACTATACA 
^ ATC|rf<X^TA>f^TC^ 
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' jets 

CcttGAkTGGXATCAAAAATGGGTCGATTTATCTCTCAACTCTTO 

AA< 3CA<3 AAGGCTATA ACCATTCAmCGmTCTCAAGGCCAGTCATCA AATTGACCAAATTGAGAGTCA 
a a rcM [TrGATTGAAGAAGATATTGCGGCAATTCGCAATGCTT^ 
AaLaaTAGTGGTCGTGTTCTTCATW 

agESa3tatggtcaagccttggatgaaattgaaaaacaattagaaaatatccaatctc 
ittotaaccttgaattcatcgggtgaccctc^ 

rnfGGX 'CTTA'AGTCATATKiTGGATCGTGTTCGAGCCTT 

CAATTAi^AGGAmGGAAGCCGGTtATCGTAAACTAATTGAtGCTAATTATCATTTO 
TGAA'GG SCGTTTCCACTTGCTTTAT^^ 

GA :AAT 3CCGaATATGAGAATGGACAGGCACAAGAGGAAATCAATGCCTTGTATGATaTTTTTACTCGAQ 

aantgctgctaagaaagtagtggaaaatctacttgcaac^^ 

ga vtaaItactitattgggagaAgatattgcacgtttga acaagacctatttacttcctgagacagctgca 

ag' jcatottcgtcgtatn^gacaga^ 
aa<3iaA€I\acc!aaoccaagcttattcagttcitgaag^ 
aTj ittgaagatgagcaaatttcagttagtcagcgcctc 
^ gt< :aaA\ggccaatgtttatgtcaatcgtgtccatactatcaagcgatacatggaaaaacgcaatctg(X 

^aglstattccacaaactitct^^ 

ta< ja1ac, ^aaaaatgattaacattgaatctgttacccgagttct 

TTI AG" A/ lACGGAAACITaTAATATTOTACaATATGCAACITTGACAGAGCAACT C^ 

CG< TATX igctcattksatgaacgcattcaagaagcatttaacgaagctttagatat^^ 

TTC ATT/ TCACGCnTCATTrGACAAGATTTCTCAAGCATTGGAAGTGGCAGAGCCTGGTGTAAC^AATCQ 
cm TGTT ACCTCATATOAGAAAACAOGTOAAACGATTCGTTTT^ 
CAQCACTOA || . 



Q.CFIH6 " homologate- of- ..SEQ. ID- NO. -11 2," 
ATC iClrrA^ATOYTATAAATGGTTAA^ 

AAv \VOT( lAACTACAGGAATCGAGGTAOAG^GtOTCGA^TCACCAOCTGC^GGTQTCTCAAAAATTQTCQT 
CG< !TGA<K}TCTTGTCTTGCGAAGATGTGCCAGAGACTCaC^ 

gaj :gtcaqatggtttgtggtgccccaaatgtocgtgctgggatca Aggt catogtogc 
ga( icxgc itatggctgataactacaaaatcaaaaaaggaaaaatccgtggtttc 

TCI GTTC /VCrTQGTGAATTGGGAATTTCTGACTCAGTTGTC 
TTCCCTGA.AGATGCCGTGCCAGGTGAGGAAGTCTTTTGTTACCTAG 

XTTjCCAT. cacaccaaacc otgcagatgccctttctatgtgtggagtggctcacgaagtggcagccatcta 

TCf CAA< KKlAGTCAACTTTAAAGAATTTACICrAAGAOAAACTA 
GTC AGCfc TTGAGACAGAGAAGGCGCCCTACTATGCAGCTCGTA 

GTC CAC& ATCqfiTGCAAAACCTTCTCATGAACGAAGGAATCCGTCCCATCAATAACGTAG 
CAA CTA< ATCCtGCTCTATTTTGGTCAACCAATGCATGCCTT^ 
TCC GTGT QCGf)3AAGCGCGTGCTGGTGAAAAATTC 

GAf TGAX ICTAGTCATCACTGTtJGCAGACAAGCC^GTAGCCCTTGCAGGTGTCATGGG^ 
GA/ ATC7 CTGAAAAATCTAGTCGTGTTGTCCTTGAAGCTGCTGTTTTCA 
^ AAC TGGTCGCCTGAACCTTCGTTCTGAGTCATCTTCTCGCTTTXjAAAAAGGAATT 

^ AA1 GAAC iCCCTTCATGCGGCAGCTAGCCTGATTGCGGAACTTGCAGGTGCGACGGTGCGTAAGQGCATC 
GTT rCAG KiGGTGAGCTTGATAGTTCAGATGTAGAAGTTTCTTCAACCCTTGCTGATGTTA^ 
CdC AAC1 GAGCTGTCTTATGCTGATCTAGAAGACGTCTTCCGTCGTCTTGGCnTro 
C At ACAC CTTtACAGTGAGAGtCCCACGTCGTCGTTGGGATATCACAATCGAAGCTGACCT 

a at tgct ^tatxjtatogttatgaccgcttgccaactagtctaccaaaagacgatggtacagcaggtgaa 
. ttg kcag ccacacaaaaactccgccgtcaagttcgtactattgctgaaggagcaggtttgacagaa^ 
tca 3cta ractctaacaactcctgaaaaagcagttgagtttacggctcaaccaagtaaccttacgg^ 
CATjTGCj ccaatgactgtggatcgttcagtcctccgtcaaaatatgatttcaggtatc 
. cct, ^caacgtgpctcgtaagaataaaaacttggccctttacgagatrggaaaagtcm 

TAA rCCA AAAGAAOAACnTCCAAATGAAATCAACAGTTTTGCCm 
AAlAGATI TCCAAACAGGAGCAGTTGCAGTTGATTTCTt^ 
TCG CTTG< JGACtCCAAGTAACCTATACAGCAACATCTOAA 
. GTG ^TTreACTG5G<5TGAC!CAAGTTCTTGGTTT^ 
TAT [CCAGAAAfcGTATGTGGCTGAGCTTAACCTro 

TTT< iTAGjkAATcACCAAATTCCpGGCAGTCAGCCGTGACGTTC . 
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TCCAGAA 
TC 




SL CFEH7I "Kbihologu.e of SEQ. lb NO .' '113" 

atqttasaaaacgatattaaaaaagtcctcgtttc 

^TGCTPAA^CTAMGACTATGCAGOAAAAAATCCAATCTrAGTTCGG^^ 
CT rTTA; XTCCroAATTCGTC^CATATTGATACAC^ 

' 0 CA rGGT SGA^GCA AGTAGTGGTGTTATCAATATTAAACAAGATOTCACTCAAGATA 

C CA FGTT :tatt^tagaagatatcattgatacaggtgaaactttgaag^ 

AAA.QAC AAG(^GClTCTGTTAAAATrGCAACCTTGtrGGATAAACCAG^GQAS 
. £\> AG GCAC ACTAjTACCTGGTTTACTATCCCAAATGAGTTTGTAGTAGGlTTJ 

SS>g1 
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SEQUENCE LISTING . 

1 ? I * ■ 

<110> Dougherty, Thomas J. 
( t Pucci, Michael J. i ( . ' ■ ' 

5 ' Dougherty, Brian A. ■ ( 

Davison, Daniel B. ' * , ' 

Bruccoleri, Robert E; ' ' . . 

Thanassi, Jane A, 

10 <120> NOVEL BACTERIAL GENES AND PROTEINS THAT ARE ESSENTIAL , 

FOR CELL VIABILITY AND THEIR USES 

' <130> 30436. 44USU1 

15 <1,40> Not yet known 
<141> 2000-12-30 

<160> 226 ! 

i 

20 <170> Patentln Ver. 2.0 

<2io> i • , 

<211> 708 . 
<2'12> DNA I 

25 <213> Streptococcus pneumoniae | | 

i 

<400> 1 , 

atgatttatg caggaattct tgccggtgga actggcacac gcatgggaat cagtaacttg 60 

ccaaaacaat .ttttagagct aggtgatcga cctattttga ttcatacaat tgaaaaattt 120 

30 gtcttggaac caagtattga aaaaattgta gttggggttc atggagactg gg.ttttacat 180 

gcagaagatc ttgtagataa atatcttcct cttcataagg aacgtattat cattacaaag 24 0 

ggtggtgctg accgcaatac aagtattgag aacatcattg aagccattga tgcttatcgc 300 

ccgcttactc cagaggatat cgttgttacc cacgattctg ttcgtccatt tattacgctt 360 

cgcatgattc aagacagtat caaacttgct caaaatcatg acgcagtgga tacagtagta 420 

35 gaagcagtgg atactatcgt tgaaagtacc aatggtcaat tcattacagg tattccaaat 480 

cgtgctcacc tctatcaggg acaaacacct caaacattcc gttgcaagga cttcatggac 540 

ctttatggat ctctttctga tgaagagaag gaaatcttga cagatgcatg taaaatcttt 600 

gtgatcaaag gaaaagatgt agccttggcc aaaggcgaat actcaaatct gaagattaca 660 

accgtaacag atttgaagat tgcaaaaagt atgattgaga aagactag 708 



<210> 2 
<211> 558 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 2 

atggctaacg taattattga aaaagctaaa gagagaatga cccagtctca ccaatcactt 60 

gctcgtgaat ttggtggtat ccgtgctggt cgtgccaatg caagcttgct tgaccgtgta 120 

catgtagaat actatggagt cgaaactcct cttaaccaaa tcgcttcaat tacgattcca 180 

50 gaagcgcgtg ttttgttggt aacaccattt gacaagtctt cattgaaaga catcgaacgt 240 

gccttgaacg cttctgatct tggtatcaca ccggctaatg acggttctgt gattcgcttg 300 

gttatcccag ctcttacaga agaaactcgt cgtgaccttg ctaaagaagt gaagaaggtc 360 

ggcgaaaatg ctaaagtggc tgtccgcaat atccgtcgcg atgctatgga cgaagctaag 420 

aaacaagaaa aagcacaaga aatcactgaa gacgaattga agactcttga aaaagatatt 4 80 

55 caaaaagtaa cagacgatgc tgttaaacac atcgacgaca tgactgctaa caaagagaaa 540 

gaacttttgg aagtctaa * 558 
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<210> 3 , 
<211> 1353 i . . 

<212> DNA 

i ■ 

<213> Streptococcus pneumoniae ' i, 

5 • , ■ ■ , - '■ ' ' 

<400> 3 (( . , • • • i . , 

atgggtaaat attttgggac tgatggagtc cgtggagaag 1 ctaacctaga actaacacca 60 

gaattagcct ttaaactagg acgttttgga ggctatgttc ttagtcaaca tgaaacggaa 120 

gcgccgaaag tctttgtagg acgtgacaca cgtatttcag gggaaatgct ggaatcggcc 180 

10 ttggtggcag gtctcctttc agtagggatt cacgtataca< aacttggtgt ccttgcaaca 240 

ccagcagtag cttacttggt tgaaactgaa ggagcaagtg. ccggtgtcat gatttctgct 300 

agccacaacc cagcccttga taacggaatc aagttctttg gcggtga!tgg ctitcaaacta 360 

gatgatgaaa aagaagcaga aattgaagcc ,ttcjctagatg ctgaggaaga cactcttcct 420 

cgtcc&agtg cagaaggctt aggaattttg gtagattatc cagaaggctt gcgtaagtat 480 

15 gaaggatacc gtgtgtcaac tggaactcct cttgatggaa ( tgaaggttgc cttggataca 54 0 

gctaatggag cagcttctac cagtgcccgt caaatctttg cagacctttfg tgcccaattg 600 

acggttatcg gggaaacacc agacggtctt aacatcaacc ttaatgttgg ttcaacacat 660 

ccagaagccc ttcaagaagt ggtcaaagaa agtgggtcag ctattggttt ggcctttgat 720 

ggagacagtg accgcttgat tgctgttgat gagaatggtg acatcgtcga tggtgacaag 780 

20 attatgtaca tcatcggaaa atacctttct gaaaaagg^c aattggctca aaatacaatt 840 

gtgacaactg ttatgt.ctaa ccttggtttc cacaaggcct tgaatcgcga aggtattaac 900 

aaggcagtta ictgcagttgg tgaccgctac gttgttgaag aaatgagdaa atcaggctac 960 

aaccttggtg gtgaacagtc tggtcacgtt atcttgatgg attacaatac cacaggtgat 1020 

ggtcaattat cagcagttca attgactaaa atcatgaagg aaactggtaa gagcjbtatca 1080' 

25 gagttggcgg cagaagtaac gatttatcca caaaaattag ttaatatccg agtggaaaac 1140 

gtcatgaagg aaaaggccat ggaagtgcca gctatcaagg ccatcatcga gaagatggaa 1200 

gaagaaatgg cggggaacgg ccgtatcctt gttcgtccaa gtggaacaga gcccctcttg 1260 

cgtgttatgg cagaagcgcc tacaacagaa gaagtggact actatgttga taccatcaca 1320 

gatgtagttc gtgctgaaat tggga^tgac taa 1353 



30 



35 



<210> 4 
<211> 705 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 4 

atgaaaaaaa tactaattgt agatgatgag aaaccaatct cggatattat caagtttaat 60 
atgaccaagg aaggttatga agttgtaact gcttttaatg gtcgtgaagc gctagagcaa 120 
tttgaagcag agcaaccaga tattattatt ctggatttga tgcttccaga aattgatggt 180 

40 ttagaagttg ctaagaccat tcgtaagaca agcagtgtgc ccattcttat gctttcagcc 24 0 
aaagatagtg aatttgataa ggttatcggt ttggaacttg gggcagatga ctatgtaacg 300 
aaacccttct ccaatcgtga gttgcaggcg cgtgttaaag ctcttctgcg tcgttctcaa 360 
cctatgccag tagatggtca ggaagcagat agtaaacctc aacctatcca aattggggat 420 
ttagaaattg ttccagacgc ctacgtggct aaaaaatatg gcgaagaact agacttaacc 480 

45 catcgtgaat ttgagctttt gtatcattta gcatcgcata caggtcaagt catcacgcgc 540 
gaacacttgc ttgagactgt ctggggttat gactattttg gtgatgtccg cacagttgat 600 
gtgactgtac gacgtctgcg tgagaagatt gaagatacgc ccagccgacc agagtatatc 660 
ttgacgcgcc gtggtgtagg gtattacatg agaaataatg cttga 705 

50 <210> 5 

<211> 1107 
<212> DNA 

<213> Streptococcus pneumoniae 
55 <:4J30> 5 

atggaagaaa ttctctgtat tggttgtgga gcaaccattc agacgacaga taaagctggt 60 
cttggtttta ccccccagtc ggcacttgaa aaaggtttgg agactggcga agtctattgc 120 
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55 



c'aacgctgtt , tccgtctccg ccactacaat gaaatcacag atgtpcagtt gacgaacgat 180 

gatttcctca agctcttgca cgaggtggga gacagtgatg ctttagtggt caatgtcatt 240 

gatatctttg attttaatgg atctgtcatc ccaggtttac cacgtttcgt ctcgggcaat 300 

gatgtcctct tggtaggaaa taaaaaagat atccttccta . agtcagttaa ' gtctggtaag 360 

5 attagccagt ggctcatgaa acgtgcccat gaagaaggtc ttcgtccagt cgatgtggt;c 420 1 

ctaacttcag cacaaaataa acatgccatt aaggaagtca 'ttgacaagat tgaacactac ,480 

cgtaagggcc gcgatgtcta tgtggtcggt gtgaccaacg ttggaaaatc aactctaatc 540 

aatgctatta tccaagaaat cacgggtgat cagaatgtca tcactacttc acgcttccca 600 

gggacaacct tggacaaaat agagattccg cttgacgacg gatcttatat ttacgatacg 660 

10 ccgggaatta tccaccgcca ccagatggct cactacttga cggccaaaaa cctcaagtat 720 

gtcagtccta aaaaggaaat caagcctaag acctatcagc t.taatcctga gcaaacccta 780 

tttttaggtg gtttgggacg ctttgacttt atagcaggag aaaagcaagg atttactgct 840 

* ttctttgata atgaactcaa actccatcgt agcaagcttg aaggagctag tgctttctac 900 

gataagcacc tgggaactct tctgacacca ccaaatagca aggaaaaaga agatttccca 960 

15 aggctagtcc agcatgtctt taccattaa]a gataagacag acctagtcat ctcaggccta 1020 

ggatggattc gtgtaacagg cacagcaaaa gtcgccgtct gggcaccaga aggcgtcgcc 1080 

gtcgtcacac gaaaagcaat tatttaa i i 1 1107 



<210> 6 
20 <211> 1461 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 6 1 

25 atgtatccag atgatagttt gacattgcab acggacttgt accagatcaa catgatgcag <p0 
gtttactttg accaagggat tcacaataag aaggcggtct ttgaggtgta tttccgccaa 120 
cagcctttta agaacggcta tgcggttttt gcaggtttgg aaagaattgt gaactatctt 180 
gaagacttgc gtttttcaga tagtgatata gcctatttgg agtcgcttgg ttatcatggg 240 
gcgttcttgg attaccttcg caatttcaag ttggagttga ccgttcgttc tgcccaagaa 300 
30 ggggatttgg tttttgctaa tgaaccgatt gtgcaggtgg aaggacctct agcccaatgt 360 
cagttggtcg aaacggctct tttgaacatc gtcaactacc agaccttggt ggcgacgaag 420 
gcagctcgta ttcgttcggt tatcgaagat gaacccttga tggagtttgg gacacgtcgg 4 80 
gctcaagaaa cggatgcggc catctgggga acacgcgcag cggtgattgg tggcgccaat 54 0 
ggaaccagca acgtgcgtgc gggtaagctc tttgacattc ctgttttggg aacccatgcc 600 
35 catgccttgg tacaggttta tggcaatgac tatgaggctt tcaaggctta cgctgcgacc 660 
cacaaaaatt gtgtctttct tgtggatacc tatgataccc ttcgcatcgg tgtaccagct 720 
gccattcagg tggcgcgtga gctgggtgat cagattaact ttatgggtgt gcggattgac 780 
tctggggata ttgcctacat ttctaagaaa gtccgtcagc aactggacga ggctggattt 840 
acagaggcta agatttatgc ttctaatgat ttggacgaaa atactatcct caatctcaag 900 : 
40 atgcaaaagg ccaagattga tgtctggggt gtgggtacca agctgattac agcctatgac 960 
cagccggctc ttggggcggt ttacaagatt gttgcaatcg aagatgaaac tggtcagatg 1020 
cgcaatacga ttaagctgtc taataatgcg gaaaaagtgt cgacgccagg taagaagcag 1080 
gtgtggcgca ttaccagtcg tgaaaaaggt aagtcagaag gtgattacat cacttatgat 114 0 
ggtgtggata ttagcgacat gacagaaatc aagatgttcc atccgaccta tacatacatc 1200 
45 aagaagacgg ttcgtaattt tgatgccgtt cctctcttgg tggatatctt caaagaagga 1260 
atattagttt acaacttgcc tagtttgact gacattcagg attatgcccg taaagaattt 1320 
gacaagttgt gggatgagta taagcgtgtg ctcaatccgc agcactatcc agtggatttg 1380 
gcgcgtgatg tatggcaaga taagatggac ttgattgata agatgcgcaa ggaagccctt 1440 
ggtgaaggag aagaagaatg a .14 61 



<210> 7 
<211> 852 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 7 

atggctacta ttcaatggtt tcctggtcac atgtctaaag ctcgtcgaca ggtgcaggag 60 
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20 



aatttaaaat ttgttgattt tgtgacgatt ttagtagatg cacgcttgcc tctatctagt 120 

caaaatccta tgttgaccaa gattgttggt gataaaccaa aactcttgat ttjbaaacaag 180- 

gccgacttgg ctgatccagc aatgaccaag gaatggcgtc agtattttga atcacaagga 240. 

atccagacgc tagctatjcaa ctccaaagag caagtgactg taaaagttgt aacagatgcg 300 

5 gccaagaagc tcatggct^a taagattgct ccjccagaaag aacgtgggat; tcagattgaa 360 

accttgcgta ctatgatiat cgggattcca aacgctggta clatcaactct gatgaaccgt 42Q 

ttggctggta * aaaagattgc tgttgttgga aacaagccag ' gggtcacaaa aggtcaacaa' 480 

tggcttaaaa ccaataaa^a cctggaaatc ttggatacac cggggattct ctggcctaag 540 

tttgaggatg aaactgttgc acttaagttg gcattgactg gagctatcaa ggatcagttg 600 

10 cttcctatgg atgaggttac catttttggt atcaattatt tcaaagaaca ttatccagaa 660 

aagctggctg aacgcttcaa acaaatgaaa attgaagaagf aaccgtctgt gattattatg , 720 

gatatgaccc gcgccctcgg , tttccgtgat gactatgacc gtttttacag tctcttcgtg 780 

aaggaagttc gtgatggcaa actcggtaac tataccttag atacattgga agacctcgat 840 

ggcaa,cgatt ,aa 1,1 852 

<210> 8 ' 1 , 

<211> 471. ' " ( 

<212> DNA ' ' 

<213> Streptococcus pneumoniae * i ■ 

<400> 8 1 

atgattaaca, atgttgtact tgtagggcgt atgacacgtg acgctgagtt gcgttatacc 60 
ccatcaaatg tagcagt^gc gacttttact cttgcagtaa accgtacatt taagagtcaa 120 
aatggtgaac gtgaggctga ttttatcaat gtcgttatgt ggcgccaaca ggctgaaaat 180 
25 cttgctaact gggctaaaaa aggctcactt atcggggtga caggtcgtat ccagactcgt * 240 
agttacgata accagcaagg acaacgtgtc tacgtgacag aggtcgtggc tgagaatttc 300 
caaatgttgg aaagccgtag tgtgcgtgag ggccacacag gtggagctta ctctgcacca 360 
actgcaaact attcagcacc tacaaattca gtaccagact tttcacgtaa tgaaaatcca 420 
tttggagcaa caaacccatt ggatatttca gatgatgatt taccattcta a 471 

<210> 9 
<211> 975 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 9 

atgaaaacgc gtattacaga attattgaag attgattatc ctattttcca aggagggatg 60 
gcctgggttg ctgatggtga tttggcaggg gctgtttcca aggctggagg attaggaatt 120 
atcggtgggg gaaatgcccc gaaagaagtt gtcaaggcca atattgataa aatcaaatca 180 

40 ttgactgata aaccctttgg ggtcaacatc atgctcttat ctccctttgt ggaagatatc 240 
gtggatctcg ttattgaaga aggtgttaaa gttgtcacaa caggagcagg aaatccaagc 300 
aagtatatgg aacgtttcca tgaagctggg ataatcgtta ttcctgttgt tcctagtgtc 360 
gctttagcta aacgcatgga aaaaatcggt gcagacgctg ttattgcaga aggaatggaa 4 20 
gctggggggc atatcggtaa attaacaacc atgaccttgg tgcgacaggt agccacagct 4 80 

45 atatctattc ctgttattgc tgcaggagga attgcggatg gtgaaggtgc tgcggctggc 540 
tttatgctag gtgcagaggc tgtacaggtg gggacacggt ttgtagttgc aaaagagtcg 600 
aatgcccatc caaactacaa ggagaaaatt ttaaaagcaa gggatattga cactacgatt 660 
tcagctcagc actttggtca tgctgttcgt gctattaaaa atcagttgac tagagatttt 720 
gaactggctg aaaaagatgc ctttaagcag gaagatcctg atttagaaat ctttgaacaa 7 80 

50 atgggagcag gtgccctagc caaagcagtt gttcacggtg atgtggaggg tggctctgtc 840 
atggcaggtc aaatcgcagg gcttgtttct aaagaagaaa cagctgaaga* aatcctaaaa 900 
gatttgtatt acggagccgc taagaaaatt caagaagaag cctctcgctg gacaggagtt 960 
gtaagaaatg actaa 975 

55 <2io> 10 

<211> 423 
<212> DNA 
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<213> Streptococcus pneumoniae * ■ 

<400> 10 ' ■'■*•,.• 

atgatcgata ttcaagg'aa l t caaagaagcc cttccccacc gttatcctat gcttctagtg 60 

5 gaccgtgtct tggaggtgag cgaggatacc attgttgcta t^aaaaatgt 1 gaccatcaac 120 

gagcctttct ttaacggcca ctttcctcaa tacccagtta .tgccaggtgt tctgattatg . 180 

gaagccttgg cgcaaactgc cggtgtgttg gagttatcaa' aacctgaaaa taaaggaaaa 240 

ctggtctttt acgctggtat ggataaggtt aagttcaaga agcaagttgt accaggcgac 300 

caattggtta tgacagcgac ttttgtaaaa cgtcgtggca ccatagctgt ggttgaagca 360 

10 aaggctgaag tggatggcaa gcttgcagcc agtggtaccc 1 ttacttttgc aattgggaac 420 
taa 1 ' 1 , '423 

<210> 11 \ •. , 

<211>' 1023 ' ' ■ • , . ' 

15 <212> DNA 1 , 

<213> Streptococcus pneumoniae ' • 

<400> 11 . ' ■ 

atgattaatc aaatttatca actaactaaa cctaagttta' tcaatgtcaa atatcaggaa 60 

20 gaggctattg accaagagaa , tcatatcctt atccgtccca actacatggc tgtctgtcat 120 
gcggatcagc gttactacca gggaaaacgt gatcccaaga ttttgaataa aaagcttcca 180 
atggcaatga 1 ttcacgagtc atgtggaatc gtcatttctg aoccgagcgg aacctacgag 240 
gttggtcaaa aagttgtcat gattcccaat cagtctccta tgcagagtga tgaagaattc 300 
tatgaaaact acatgacagg gapccatttc ttgtctagtg gatttgatgg ctttatgaga 360 

25 gagtttgttt ctctccctaa agatcgtgtg gtggcttatg atgctattga agatacggtt 420 
gcagccatta cagagtttgt cagtgtgggc atgcacgcta tgaatcgtct attgactctt 4 80 
gctcatagca agcgggagcg gatccccgtt attggagatg gaagtttagc ttttgtggtt 540 1 
gccaatatta tcaactatac tttgccagaa gcagagattg tggttattgg tcgtcattgg 600 
gaaaagttgg aactcttctc atttgccaaa gaatgctata ttacggataa tattcctgaa 660 

30 gagttggcct ttgaccatgc ttttgaatgt tgtggtggtg atggtactgg accagctatt 720 
aatgacttga ttcgctacat tcgtcctcag ggaacaattc tcatgatggg agttagcgaa 780 
tataaagtca atctcaatac tcgcgatgcc ttagaaaagg gcttgctctt ggttgggtca 840 
tctcgttctg gtcgcattga ttttgaaaat gctatccaaa tgatgaaagt caagaaattt 900 
gccaatcgtc ttaaaaatat cctttatcta gaagaacctg taagagaaat taaagatatt 960 • 

35 caccgtgtct ttgcaaccga tttaaacaca gcctttaaaa cagtgtttaa gtgggaagta 1020 
taa 1023 

<210> 12 
<211> 1344 
40 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 12 

atgaacttaa aaactacttt gggccttctt gctggacgtt cttcccactt cgttttaagc 60 
45 cgtcttggac gtggaagtac gctcccaggg aaagtcgccc ttcaatttga taaagatatt 120 
ttacaaaacc tagctaagaa ctacgagatt gtcgttgtca ctggaacaaa tggaaaaacc 180 
ctgacaactg ccctcactgt cggcatttta aaagaggtct atggtcaagt tctaaccaat 240 
ccaagcggtg ccaacatgat tacagggatt gcaacaacct tcctaacagc caaatcttct 300 
aaaactggga aaaatattgc cgtcctcgaa attgacgaag ccagtctatc tcgtatctgt 360 
50 gactatatcc agcctagtct ttttgtcatt actaatatct tccgtgacca gatggaccgt 420 
ttcggtgaaa tctatactac ctataacatg atattggatg ccattcggaa agttccaact 480 
gctactgttc tccttaacgg agacagtcca cttttctaca agccaactat tccaaaccct 540 
atagagtatt ttggttttga cttggaaaaa ggaccagccc aactggctca ctacaatacc 600 
gaagggattc tctgtcctga ctgccaaggc atcctcaaat atgagcataa tacctatgca 660 
55 aacttgggtg cctatatctg tgagggttgt ggatgtaaac gtcctgatct cgactatcgt 720 
ttgacaaaac tggttgagtt gaccaacaat cgctctcgct ttgtcataga cggccaagaa 780 
tacggtatcc aaatcggcgg gctctataat atctataacg ccctagctgc tgtggccatc 840 
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gcccgtttcc tcggcgcaga ttcccaact'c atcaaacagg gatttgacaa gagccgtgct 900 
gtctttggac gccaagaaac ctttcatatc ggtgacaagg aatgtaccct tgtcttgatt 960 
aaaaatccag tcggtgcaac ccaagctatc gaaatgatca aactagcacc ttatccattt 1020 
agcctatctg tcctccttaa tgccaactat gcagatggaa ttgacactag ctggatct.gg 1080 
gatgcagact ttgagcaaat cactgacatg gacattcctg aaatcaacgc tggcggtgtt' 1140 
cgtcattctg aaatcgctcg tcgcctccga gtgactggct .atccagctga gaaaatcact 1200 
gaaacgagta atctggagca agttctcaag accattgaga atcaagactg caagc&tgcc 1260 
tatattctgg caacttatac tgccatgctg gaatttcgtg aactgctggc tagtcgtcag 1320 
attgttagaa aggagatgaa ctaa 134 4 



<210> 13 
<211> 783 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 13 

atggtttata cttcactttc ctcaaaagat ggcaattacc cctatcagct caacattgqc , 60 

.cacctctacg gaaatctcat gaatacctac ggggacaatg gaaacatcct catgctcaag 120 

tatgtggctg aaaaactggg agcccatgtg accgttgaca tcgtttctct ccatgatgac 180 

tttgatgaaa atcactacga catcgccttt ttcggtggtg gtcaagactt tgaacaaagt 240 

atcattgcag acgacctacc tgctaaaaaa ( gagagGattg acaactacat ccaaaacgac 300 

ggtgtagttc tggctatctg cggtgcftttc caactattgg gtcaatatta tgttgaagct 360 

tcaggaaaac gtatcgaagg gctaggggtc atgggacact acacgctcaa ccagaccaat 4 20 

aaccgtttta tcggtgacat caagattcac aatgaagatt tcgatgaaac ctactatgga 4 80 

tttgaaaatc accaaggtcg taccttcctp tctgatgacc aaaaaccgct gggacaggtt 540 

gtctatggaa atggaaacaa cgaagaaaag gtcggtgaag gggttcatta taagaatgtc 600 

tttggttcct acttccacgg gcctatcctc tctcgtaatg ccaatctggc ttatcgccta 660 

gttactactg ccctcaagaa gaaatatggt caggacatcc aactccctgc ctatgaggac 720 

attctcagcc aagaaatcgc tgaagagtac agtgacgtca aaagcaaggc tgacttttct 780 

taa 783 



<210> 14 
<211> 276 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 14 

atggcaaaca aacaagattt gatcgctaaa gtagcagaag ctacagaatt aactaagaaa 60 
gactcagcag* cagcagttga agctgtattt gcagcagtag ctgactatct tgcagctggt 120 
gaaaaagttc agttgatcgg ttttagtaac tttgaagttc gtgagcgcgc agaacgtaaa 180 
ggtcgcaacc cacaaactgg taaagaaatg acaattgcag cttctaaagt accagcattc 240 
aaagctggta aagctcttaa agacgctgtt aaataa 276 

<210> 15 
<211> 840 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 15 

atgggaattg ctctagaaaa tgtgaatttt acatatcaag aaggtactcc cttagcttca 60 
gcagctttgt cggatgtttc tttgacgatt gaagatggct cttatacagc tttaattggg 120 
cacacaggta gtggtaaatc aactatttta ca.actcttaa atggtttatt ggtgccaagt 180 
caagggagtg tgagggtttt tgatacctta atcacctcga cttctaaaaa taaagatatt 24 0 
cgtcaaatta gaaaacaggt tggcttggta tttcagtttg ctgaaaatca gatttttgaa 300 
gaaacggttt tgaaggacgt tgcttttgga ccgcaaaatt ttggagtttc tgaagaagat 360 
gctgtgaaga ctgcgcgtga gaaactggct ctggttggaa ttgatgaatc actttttgat 420 
cgtagtccgt ttgagctgtc agggggacaa atgagacgtg ttgccattgc aggcatactt 480 
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35 



gccatggagc cagctatatt agtcttacjat gagccaacag ctggtctaga tcctctagcfg 540 

agaaaagagt tgatgaccct gttcaaaaaa ctccaccagt cagggatgac catcgtcttg 600 

gtaacgcatt tgatggatga tgttgctgaa tatgcgaatc aagtctatgt aatggaaaag 660 

, ggacgtttag taaagggggg caaaccaagt gatgtctttc aagacgttgt ' ttttatggaa, 720 ( 

5 gaagttcagt tgggagtacc taaaattacg gccttttgta aacgattggc tgatagaggc 780 

gtgtcattta aacgattacc ggttaagata gaggagttca aggagtcgct aaatggatag 840 

- > ' 1 

<210> 16 

<211> 930 , ! 
10 <212> DNA , ■ ■ , 

<213> Streptococcus pneumoniae ' , 

* 

1 <400> 16 

atggatattc aatttttagg aacgggggct ggtcagccct ctaaagcccg caacgtttca 60 

15 agtctcgccc tgaaactttt ggatgagatjb aacgaagttt ggctctttga ctgtggagaa 120 

ggtacgcaaa atcgcattct ggaaaccaca attcgaccac gtaaggtcag caaaatcttt 180 
attacccatc tgcatggaga ccacattttt ggtttgccag gtttcctttc tagccgtgdc' ' 240 

tttcaggcca atgaagagca gacagatttg gaaatctacg gacctcaagg aatcaagtcal 300 

tttgtcttaa ccagccttcg tgtgtcaggt tctcgtctgc cctaccgcat ' tcatttccat 360 

20 gagtttgacc aagattctct aggtaaaatt cttgaaatcg ataaattcac tgtgtatgca 420 

gaggagctgg accacactat tttctgtgtt ggctatcgtg tcatgcaaaa ggatctagaa 4 5.0 

gggacgctgg atgctgaaaa actcaaggct gctggtgttc cgttcggccc gctttttggt 540 

aaaatcaaaa atggccagga tcttgttttg gaagacggaa ctgaaatcaa ggcagcagac 600 

tatatctcaig cgccacgtcc aggtaagat^ atcactattt taggagacac tcgaaaaacg 660 

25 gatgccagtg tgcgtctggc tgtcaatgci gatgtcctag ttcatgagtc cacttatggc 1^20 

aagggtgatg aaaaaattgc tcgtaaccat ggtcactcaa ctaatatgca agctgcacaa 780 

gtagcggtag aagcaggtgc caaacgcctc ctactcaacc atatcagtgc ccgtttcctc 840 

tcaaaagata tt'agcaaact caagaaagac gctgccacaa tttttgaaaa tgtccatgtg 900 

gtcaaagact tggaagaagt ggaaatctag 930 

<210> 17 
<211> 1662 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 17 

atgagtaata tcagtttaac aacacttggt ggtgtgcgtg agaatggaaa aaatatgtac 60 

attgctgaaa ttggagagtc catttttgtt ttgaatgtag ggt-taaaata tcctgaaaat 120 

gaacaattag gggtcgatgt ggtgattcca aacatggatt acctttttga aaatagcgac 180 

40 cgtattgctg gggttttctt gacccacggg catgcggatg cgattggtgc tctaccttat 240 

ctcttggcag aggctaaagt tcctgtattt gggtctgagt tgaccattga gttggcaaag 300 

ctctttgtca aaggaaatga tgccgttaag aaatttaatg atttccatgt cattgatgag 360 

aatacggaga ttgattttgg tgggacagtg gtttccttct tccctacgac ttactccgtt 420 

ccagagagtc tgggaattgt cttgaagaca tcggaaggaa gcatcgttta tacaggtgac 480 

.45 ttcaaa'tttg accaaacggc tagtgaatct tatgcaactg attttgctcg tttggcagag 54 0 

attggtcgtg acggcgtcct ggctctcctc agtgattcgg ccaatgcaga cagcaatatt 600 

caggtggcta gtgaaagtga agttagggat gaaattaccc aaactattgc tgactgggaa 660 

ggtcgtatca tcgttgcagc tgtttccagt aatctttctc gtattcagca gatttttgac 720 

gctgcggata aaacaggtcg acgtatcgtc ttgacaggat ttgatattga aaatatcgtc 780 

50 cgcacagcga ttcgtcttaa gaagttgtct ttagccaacg aaattctctt gattaagcct 840 

aaagatatgt ctcgctttga agaccatgag ttgattattc ttgagacagg tcgtatgggt 900 

gaacctatca atggacttcg taagatgtcg attggtcgcc atcgttatgt agaaatcaag 960 

gatggggacc tggtctatat tgctacggct ccgtctattg ctaaagaagc ctttgttgcg 1020 

cgtgtagaaa atatgattta tcaggcaggt ggggttgtca aattgattac" ccaaagttta 1080 

55 catgtatcag ggcacggaaa tgtgcgtgat ttgcagctga tgatcaatct tttgcaacct li40 

aagtacctct tccctgtcca aggggagtat cgtgagttgg atgctcacgc taaggctgcc 1200 

atggcagttg ggatgttgcc agaacgcatc ttcattccta aaaaggggac gaccatggct 1260 
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tacgagaatg gagactttgt tccagctgga tcggtttcag caggagatat cttaattgat* 1320 

gggaatgcca ttggtgatgt tggaaatgtt gttcttcgtg accgtaaggt cttgtcagag 1380 

gatggaattt tcatcg'tggc tattacagtc aaccgtcgtg agaagaaaat tgtggctaga 14 40 

gctcgtgttc acacgcgltgp atttgtttat cticaagaaga gtcgcgatat tctccgtgaa 1500 

agttcagaat tgattaacca aacggtagaa gattatcttc a,acjgagatga' ctttgactgg 1560 

gcagatctta aagggaaggt tcgagataat ttgaccaagt ,atctctttga ccaaaccaag ( 1620 

cgtcgtccag ctattttacc agtagtcatg gaagcaaaat 1 aa ' . 1662 

<210> 18 

<211> 951 i ; ' ' 

<212> DNA * ' ' : " ( - 
<213> Streptococcus pneumoniae 

f , i ■ ■ 

<400>' 18 ' ' ■ . , 

atgacaaaag aatttcatca tgtaacggtc ttactccacg .aaacgattga tatgcttgac 60 

gtaaaacctg acggtatcta cgttgatgcg actttgggtg gagcaggcca tagcgagtat 120 

ttattaagta aattaagtga aaaaggccat ctctatgcct ttgaccagga tcagaatgcc 180 

attgacaatg cgcaaaaacg cttggcacct tacattgaga' agggagtggt gacctttatc 24 0 

aaggataact tccgtcattt acaggcacgt ttgdgcgaag ctggtgttca ggaaattgat 300 

ggaatttgtt atgacttggg ( agtgtctagt cctcaattgg accagcgtga gcgtggtttt 360 

tcttataaaa aggatgcgcc actggacatg cggatgaatc aggatgctag tctgacagcc 420 

tatgaagtgg tttaatcatta tgactatcat gatttggttc gtattttctt caaatacggt 480 

gaggataaat tctctaaaca gattgcgcgt aagattgagc aagcgcgtga agtgaagccg 540 

attgagacaa cgactgagtt agcagagatt atcaagttgg tcaaacctgc caaggaactc 600 

aagaagaagg gtcatcctgc taagcagatt ttccaggcta ttcgaattga agtcaatgat 660 

gaactggggg cggcagatga gtccatccag caggctatgg atatgttggc tctggatggt 720 

agaatttcag tgattacctt tcattcctta gaagadcgct tgaccaagca attgttcaag 780 i 

gaagcttcaa cagttgaagt tccaaaaggc ttgcctttca tcccagatga tctcaagccc 84 0 

aagatggaat tggtgtcccg taagccaatc ttgccaagtg cggaagagtt agaagccaat S00 

aaccgctcgc actcagccaa gttgcgcgtg gtcagaaaaa ttcacaagta a 951 

<210> 19 
<211> 999 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 19 

atgagtagaa ttttagataa tgagataatg ggggatgagg agttagtaga acgcacgctc 60 

cgtcctcagt atttacgtga atatatcgga caggataagg tcaaggacca gctacaaatc 120 

tttattgaag ctgccaaaat gcgggatgaa gcgctggatc atgtgctctt atttgggcct 180. 

ccaggtctcg ggaaaacgac catggccttt gttattgcca acgaactggg agtcaatctt 240 

aagcagacgt cgggtccagt cattgaaaaa gccggagatc tggtagctat tttgaatgag 300 

ttagagcctg gggatgtact ttttattgat gagatccatc gtttgccaat gtcagtggaa 360 

gaggtgcttt atagtgctat ggaggacttc tacatcgata ttatgattgg ggctggtgag 420 

ggtagtcgta gtgttcattt ggagttacca ccttttacct tgattggtgc gacgactcgg 480 

gctggtatgc tctccaatcc gctacgggca cgttttggga ttacaggcca tatggagtat 540 

tatgcccatg ctgacttgac agaaattgtc gagcggacgg cagatatttt tgagatggaa 600 

atcactcatg aggcagcatc tgagttggcc ctacgtagtc gtgggacccc tcgtattgcc 660 

aatcgtctcc tcaagcgcgt gcgcgatttt gcccagataa tggggaatgg ggtaattgat 720 

gatattatta ccgataaggc tttgactatg ctggatgttg accatgaagg tttggactat 780 

gtggatcaaa aaatccttcg taccatgatt gagatgtaca gtggaggacc tgttggtcta 840 

ggaactcttt ctgtgaatat cgccgaagag cgtgagacag ttgaa'gacat gtatgagcct 900 

tacttgattc aaaaaggttt tatcatgcgg acacggtctg gacgggtggc' gactgctaag 960 

gcatatgagc acttaggtta tgaatacagt gaaaaataa 999 

<210> 20 
<211> 1311 
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<212> DNA ' , ' 

<213> Streptococcus pneumoniae 1 

<400> 20 ' i ' 1 ■ 

5 atgagtatgt ttttagatac agctaagatt aaggtcaagg ctggtaatgg tggcgatggt 60 

atggttgcct ttcgtcgtga aaaatatgtc cctaatggag gcccttgggg* tgjgtcfatggt '120' 

ggtcgtggag gcaatgtggt cttcgttgta gacgaaggac ' tacgtacctt gatggatttc 180 

cgctacaatc gccatttcaa ggctgattct ggtgaaaaag ggatgaccaa agggatgcat 240 

ggtcgtggtg ctgaggacct tagagttcga gtatcacaag gtacgactgt tcgtgatgcg 300 

10 gagactggca aggttttaac agatttgatt aaacatgggc ' aagaatttat cgttgcccac 360 

ggtggtcgtg gtggacgtgg aaatattcgt tttgcgacac: caaaaaafcc' tfcjcaccggaa 420 

atctctgaaa atggagaacc aggtcaggaa cgtgagttac aattggaact aaaaatcttg 480 

gcagatgtcg gtttagtagg attcccatct f gtagggaagt caacactttt aagtgttatt 540 

acctcagcta agcctaaaat< tggtgcctac cactttacca ctattgtacc aaatttaggt 600 

15 atggttcgca' cccaatqagg tgaatccttt gcagtagccg acttgccagg tttgattgaa 660 

ggggctagtc aaggtgttgg tttgggaact cagttcctcc gtcacatcga gcgtacacgt 720 

gttatccttc acatcattga tatgtcagct agcgaaggcc , gtgatccata tgaggattac 78*0 

ctagctatca ataaagagct ggagtcttac aatcttcgcc tcatggagcg tccacagatt 840 

attgtagcta ataagatgga catgcctgag acftcaggaaa atcttgaaga atttaagaaa 900 

20 aaattggctg aaaattatga tgaatttgaa gagttaccag ctatcttccc aatttctgga 960 

ttgaccaagc aaggtctggc aacactttta gatgctacag ctgaattgtt agacaagaca 1020 

ccagaatttt 'tgctctacga cgagtccgat atggaagaag aagcttacta tggatttgac 1080 

gaagaagaaa aagcctttga aattagtcgt gatgacgatg cgacatgggt actttctggt 1140 , 

- gaaaaactca tgaaactctt taatatgacc aactttgatc gtgatgaatc tgtcatgaaa 1200 

25 tttgcccgtc agcttcgtgg tatgggggtt gatgaagccc ttcgtgcgcg tggagctaaa 1260 

gatggggatt tggtccgcat tggtaaattt gagtttgaat ttgtagacta g 1311 

i 

<210> 21 
<211> 519 
30 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 21 

atgaactact ttaatgttgg gaaaatcgtt aatacgcagg gattacaggg tgagatgcga 60 

35 - gtcttgtctg tgacggattt tgcagaagaa cggtttaaaa aaggagctga gctggctttg 120 

tttgatgaaa aagatcagtt tgtccaaaca gtgaccatcg ctagccaccg taaacagaag 180 

aactttgaca ttattaaatt caaagatatg taccatatca atactatcga aaagtacaag 240 

ggatacagtc tcaaggtcgc tgaggaagat ttgaatgacc tagaccjatgg tgaattttac 300 

tatcacgaga ttatcggttt ggaagtctat gagggtgata gcttggttgg aaccatcaag 360 

40 gaaatcctgc aaccaggtgc taatgatgtc tgggtggtca aacgaaaagg caaacgtgat 4 20 

ttgcttttac cttatatccc accagtggtt ctcaatgttg atattccaaa taaacgggtc 4 80 

gatgtggaaa tcttagaagg gttagacgat gaagattga 519 

<210> 22 
45 <211> 720 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 22 

50 atgaagattg atattttaac cctctttcca gagatgtttt ctccactgga gcactcaatc 60 
gttggaaagg ctcgagaaaa agggctcttg gatatccagt atcataattt tcgagaaaat 120 
gctgaaaagg cccgtcatgt agatgatgag ccctacggag gcggtcaggg catgttgctc 180 
agagcacaac ctattttcga ttcctttgat gctattgaaa agaaaaatcc gcgcgttatt 240 
ctcctcgatc ctgctggaaa gcagtttgat caggcttatg ctgaagattt ggctcaagag 300 . 

55 gaagagctaa tctttatctg tgggcactat gagggttatg atgagcgcat taagaccttg 360 
gtaacagatg agatttccct aggcgactat gtcctcactg gtggagaatt ggcagctatg 420 
accatgattg atgctacagt tcgcctgatt ccagaagtga ttggcaagga gtctagccac 4 80 
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caagatgata gtttttcttc aggtctttta gaatatcatc agtacacacg tccctatgat 540 

tatcgaggca tggtcgtgcc agatgtattg "atgagtggcc atca'tgaaaa gattcgtcag 600 

tggcgattgt acgagagttt aaagaaaacc tacgagcgca gaccggattt acttgaacat 660 

tatcaactga cagtagaaga agaaaaaatg ctggcagaaa ( tcaaagaaaa , caaagaat^a . 7,20 

<210> 23 1 ' 

<211> 561 ' . • ' 

<212> DNA 

<213> Streptococcus pneumoniae , 

<400> 23 . 1 , ' ■'' 

atgattgaag caagtaaatt aaaagctggt atgacctttg aaacagctga cggcaaattg 60 ' 
attcgcgttt tggaagctag tcaccacaaja ccaggtaaag gaaacacgat catgcgtatg '120 
aaattgcgtg atgtccgtac tggttctaca tttgacacaa gctaccgtcc agaggaaaaa 180 
tttgaacaag ctattatcga gactgtccca gctcaatact tgtacaaaat ggatgacaca 240 
gcatacttca tgaatacaga aacttacgac cagtacgaaa tccctgtagt caatgttgaa 300 
aacgaattgc tttacatcct tgaaaactct gatgtgaaaa tccaattcta cggaactg^a,- 360 
gtgatcggtg tcaccgttcc tactactgtfc gagttgacag ttgctgaaac tcaaccatct 420 
atcaaaggtg ctactgttac aggttctggt aaaccagcaa cgatggaaac , tggacttgtc 480 
gtaaacgttc cagacttcat cgaagcagga caaaaactcg ttatcaacac tgcagaagga 540 
acttacgttt ctcgtgccta a . , 561 

<:210> 24 ' 1 . 

<211> 1572 
<212> DNA 

<213> Streptococcus pneumoniae ! ' 
<400> 24 

atggcatttg aaagtttaac agaacgtttg cagaacgtct ttaaaaatct acgtaaaaaa 60 
ggaaaaatct ctgaatctga tgtccaagag gcaaccaaag aaattcgctt ggccttgct'c 120 
gaggccgacg ttgccttgcc tgttgtaaag gactttatca agaaagttcg tgagcgtgca 180 
gtcgggcatg aggtcattga tacacttaat cctgcgcaac agattattaa aatcgttgat 240 
gaggaactga cagccg.tttt aggttctgat acggcagaaa ttatcaagtc acctaagatt 300 
ccaaccatca tcatgatggt tggtttacaa g.gggctggta aaacaacctt tgctggtaaa 360 
ttggccaaca aactcaagaa agaagaaaat gctcgtcctt tgatgattgc ggcggatatt 420 
tatcgtccag ctgccattga ccagcttaag accttgggac aacagattga tgtgcctgtc 480 
tttgcacttg gaacagaagt accagctgtt gagattgtac gtcaaggttt ggagcaagcc 540 
caaactaatc ataacgacta tgtcttgatt gatactgcgg gtcgtttgca gattgatgag 600 
ctcctcatga atgagcttcg tgatgtgaaa .acattggctc .aaccaaatga aatcttgctt 660 
gtcgttgatg ctatgattgg tcaggaagca gccaatgttg cgcgtgagtt taatgctcag 720 
ttggaagtga ctggggtcat ccttaccaag attgatggcg atactcgtgg tggtgctgct 780 
ctgtctgttc gtcacattac tggaaaacca atcaagttca ctggtacagg tgaaaagatt 840 
acggacattg aaaccttcca cccagaccgc atgtctagcc gtatccttgg tatgggggat 900 
.atgctcactt tgattgagaa a'gcttctcag gaatacgatg aacaaaaagc ccttgaaatg 960 
gctgagaaga tgcgcgaaaa cacctttgat tttaatgatt tcatcgatca attagatcag 1020 
gtgcakaata tggggccgat ggaagacttg ctcaagatga ttccaggtat ggcaaacaat 1080 
ccagcccttc aaaacatgaa ggtggatgaa cgccagattg ctcgtaaacg tgccattgtg 1140 
tcttcgatga cacctgaaga gcgtgaaaac ccagatttgt taaatccaag ccgtcgccgt 1200 
cgtattgctg ctggttctgg aaatacattc gtcgaagtca ataaattcat caaggacttt 1260 
aaccaggcta aacagctcat gcagggtgtt atgtctgggg atatgaataa aatgatgaag 1320- 
caaatgggga ttaatccaaa taaccttcct aaaaatatgc caaatatggg aggaatggat 1380 
atgtctgccc ttgaaggaat gatgggacaa ggcggtatgc ctgacttatc agctctcgga 1440 
ggagcaggaa tgccagatat gagccagatg tttggtggcg gtttgaaagg taaaattggt 1500 
gaatttgcta tgaaacagtc catgaaacgt atggctaaca aaatgaagaa agcgaagaag 1560 
aaacgcaagt aa 1572 



<210> 25 
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<211> 846 

<212> DNA ■ • ' 

<213> Streptococcus pneumoniae 

5 <4 00> 25 1 ' 1 i 

atgtatctta ttgaaatttt aaaatctatc ttcttcggga ttgttgaagg aattaqggaa > 60 
tggttgccga tttccagtac aggtcatttg attttagcag' aggagtttat ccaataccaci 120 
aatcaaaatg aagcctttat gtccatgttt aatgtcgtga ttcagcttgg tgctatttta 180 
gcagttatgg tgatttattt . taacaagctc aatcctttta aaccgactaa ggacaaacag 240 

10 gaagttcgta agacttggag actatggttg aaggtcttga-' ttgctacttt acctttactt 300 
ggtgtcttta aatttgatga ttggtttgat acccacttcc at&acatggt tt,cagttgct '360 
ctcatgttga ttatctacgg ggttgccttc atctatttgg aaaagcgcaa taaagcgcgt 420 
gctatcgagc caagtgtaac agagttggac 'aagcttcctt atacgaccgc tttctatatc 480 
ggactcttcc aagttcttgo tcttttacca gggactagcc gttcaggtgc aacgattgtc 540 

15 ggtggtttgt taaatggaac cagtcgttca gttgtgacag aatttacctt ctatcttggg 600 
attcccgtta tgtttggagc tagtgcctta aagattttca aatttgtgaa agccggagaa 660 
. ctcttgagct ttgggcaatt gtttttgctc ttggtcgcga ( tgggagtagc ttttgcggtc 720 
agcatggtgg ctattcgctt cttgaccagc tatgtgaaaa aacacgactt cacccttttt 780 
ggtaaatacc gtatcgtgct tggtagtgtt ttgctacttt ' acagttt t tgt ccgtttattt 840 

20 gtataa ■ ■ 84 6 

, I 

<210> 26 ' 1 ' 
<211>1290 

<212> DNA 1 • ' 

25 <213> Streptococcus pneumoniae 

i ' 

<400> 26 

atgggattat ttgaccgtct attcggaaaa aaagaagaac ctaaaatcga agaagttgta 60 

aaagaagctc tggaaaatct tgatttgtct- gaagatattg agcctgcctt cacagaagct 120 

30 gaggaagttt ctcaagaaga agcagaggtt gaaagttctg aagaatctgt gttccaagaa 180 

gaggatagtc aagacacagt cgaagaaaat ctggatttag agccagttgt agaggtttct 240 

caagaagaag tagaagaatt tccaaactca caagaagtca cagaggaaga gaagcttgag 300 

cacgaaggaa ctgtagaaga aaataatttt gaagtgcttg aaccagaagc tcctcaaaca 360 

gaagaaactg ttcaggaaaa atatgaccgc agtcttaaga aaactcgcac aggtttcggt 420 

35 gcccgcttga atgccttctt tgctaacttc cgctctgttg acgaagaatt tttcgaggaa 4 80 

ctggaagaac tgttgattat gagtgatgtt ggtgtccaag tcgcttctaa cttaacggag 540 

gaactacgtt acgaagccaa gcttgaaaat gccaagaaac ctgatgcact tcgtcgtgtc 600 

atcattgaga aattggttga gctttatgaa aaggatggta gctacgatga aagcat.ccac 660 

ttccaagata acttgacagt tatgctcttt gttggtgtga atggtgttgg gaaaacaact 720 

40 tctatcggaa aactag.ccca ccgctacaaa cgagctggta agaaggtcat gctggttgca 780 

gcagatacct tccgtgcggg tgcagtagct cagctagctg aatggggccg acgagtagat 840 

gttccagtag taactggacc tgaaaaagct gatccagcca gcgtggtctt tgatggtatg 900 

gaacgtgccg tggctgaagg tatcgatatt ctcatgattg atactgctgg tcgtctgcaa 960 

aataaggata accttatggc tgagttggaa aagattggtc gtattatcaa acgtgttgtg 1020 

45 ccagaagcac cacatgaaac, cttcttggca cttgatgcat caacaggtca aaatgcccta 1080 

gtacaggcca aagaattttc gaaaatcaca cctttaacgg gaattgtttt gactaagatt 1140 

gatggaactg ctcgaggagg tgtggttcta gccattcgtg aagaactcaa tattcctgta 1200 

aaattgattg gttttggtga aaaaatcgat gatattggag agtttaactc agaaaacttc 12.60 

atgaaaggtc tcttggaagg tttaatctaa 1290 

50 

<210> 27 
<211> 498 
<212> DNA 

<213> Streptococcus pneumoniae 

55 

<400> 27 

atgtatattg aaatggtaga tgaaactggt caagtttcaa aagaaatgtt gcaacaaacc 60 
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1 

caagaaattt tggaatttgc agccaaaaaa ttaggaaaag aagacaagga gatggcagtc 120 

acttttgtga ccaatgagcg tagtcatgaa cttaatctgg agtaccgtga caccgaccgt 180 

ccgacagatg tcatcagcct tgagtataaa ccagaattgg aaattgcctt tgacgaagag 240 

gatttgcttg aaaatccaga attggcagag atgatgtctg, agtttgatgc, ctatattggg ,300 

5 1 gaattgttca tctctatcga taaggctcat gagcaggccg. aagaata^tgg tcacagcttt ,36(J 

gagcgtgaga tgggcttctt ggcagtacac ggctttttac i atattaacgg ctatgatcac 420 

tatactccgg aagaagaagc ggagatgttc ggtttacaag 'aagaaatttt gacagcctckt 480 
ggactcacaa gacaataa 498 

10 <210> 28 ' 

<211> 768 1 ' i 1 ' 

<212> DNA 

<213> Streptococcus pneumoniae 

* 

15 <400> 28 

atgagtattc gagtaattat tgccggtttjt aagggaaaga tgggccaggc tgcttgtcag 60 

atggtattga ctgatccaga cttggacttg gtggcagttt tggatccttt tgagtctgag 120 

tcagaatggc agggtattcc tgttttcaa!g gataaggctg atttagctgg ttttgaagcg 180 

gatgtctggg tagattttac tactccagct gttgcctacg aaaatacacg, ttttgctctt 240 

20 gaaaatggct ttgctccagt agttggaacg actggtttca cgagtgaaga aattgcagag 300 

ctaaaagaat tttctcgtgc ccaagacttg, ggtggcctga ttgcccctaa ctttgccttg 360 

ggtgctgtct tactcatgca atttgcgacg caggctgcca aatatttccc aaatgtggag 420 

iattattgagc tccatcatga caaga'aaaag gatgctccga gtggaacagc cattaaaaca 480 

gctgagttga tggcagaggt tcgagagtca attcagcaag gtgcagcaga tgaggaagag 540 

25 ctgattgctg gtgctcgtgg tgctgacttt '^atggtatgc gcatccactc agttcgtttg 600 

ccaggcttgg tagctcatca ggaagtcatc tttggcaatc agggagaagg gttgaccctc ^60 

cgtcatgact cctatgatcg catctccttc atgacaggag tcaatttggg aattaaagaa 720 
gttgtcaagc gtcatgagct tgtctatgga ttagaacact tattatga 768 

30 <210> 29 
<211> 276 
<212> DNA 

<213> Streptococcus pneumoniae 
35 <400> 29 

atggcaaaca aacaagattt gatcgctaaa gtagcagaag ctacagaatt aactaagaaa 60 
gactcagcag cagcagttga agctgtattt gcagcagtag ctgactatct tgcagctggt 120 
gaaaaagttc agttgatcgg ttttagtaac tttgaagttc gtgagcgcgc agaacgtaaa 180 
ggtcgcaacc cacaaactgg taaagaaatg acaattgcag cttctaaagt accagcattc 240 
40 aaagctggta aagctcttaa agacgctgtt aaataa 276 

<210> 30 
<211> 921 
<212> DNA 
45 <213> Streptococcus pneumoniae 

<400> 30 

atgactaaaa cagccttttt atttgctggt caaggtgccc agtatctagg gatgggacgg 60 

gatttctatg atcagtatcc gattgttaaa gaaacgattg atcgagcgag tcaggtgctc 120 

50 ggttatgatt tgcgttatct catcgatacg gaagaggaca aactcaatca gacccgctat 180 

acgcaaccag ccattctagc gacttcggtt gctatctacc gtttattgca agaaaagggc 240 

tatcagcctg atatggtcgc tggtttgtct cttggagaat actctgcctt ggtggcaagc 300 

ggcgccttgg attttgaaga tgcggttgcc ttggtagcta agcgtggagc ctatatggaa 360 

gaagcggctc . ctgctgactc tggcaagatg gtagcagttc tcaatacgcc agtagaggtc 420 

55 attgaagaag cctgtcaaaa agcttctgaa cttggagtgg ttactccagc c&actataac 480 

acacctgcac aaatcgtcat tgctggagaa gtggttgcag ttgatcgagc ggttgaactt 540 

ttgcaagaag caggtgccaa acgcttgatt cctcttaagg tgtcaggtcc ctttcacacc 600 
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gctctccttg. agccagctag ccagaaacta gctgaaactc tagctcaggt aagtttttca 660 

gattttactt gtcccctagt cggcaataca gaagctgctg tgatgpaaaa agaggacatt 720 

gctcagctct tgacgcgtca ggtcaaggaa cccgttcgtt tctatgaaag tattggggtc 780 

, atgcaagaag caggcataag caactttatc gagattggac 1 cggggaaagt' cttgtcagijt " 840, 

5 tttgttaaaa aaattgatca aactgctcac ttagctcatg tggaagatca agcgagttta '900 

gtagcacttt tagaaaaata g , ' / ■ '921 

<210> 31 ' ' ■ 

<211> 732 , ; 
10 <212> DNA , ' , 

<213> Streptococcus pneumoniae 

' <400> 31 

atgaaactag aacataaaaa tatctttatit acaggttcga gtcgtggaat tggtcttgcc 60 
15 atcgcccaca agtttgctca agcaggagcc aacattgtct taaacagtcg tggggcaatc 120 
tcagaagaat tgctcgctga gttttcaaac tatggtatca aggtggttcc . catttcagga 180 
gatgtatcag attttgcaga cgctaagcgt atgattgatc aagctattgc agaactgg^t 1 240 
tcagtagatg ttttggtcaa caatgcaggg attacccaag atactcttat gctcaagatg 300 
acagaagcag attttgaaaa agtgctcaag gtcaatctga ctggtgcctt 1 taatatgaca 360 
20 caatcagtct tgaaaccgat gatgaaagcc agagaaggtg ctatcattaa tatgtctagt 420 
gttgttggtt tgatggggaa tattggtcaa* gctaactatg ctgcttctaa ggctggcttg 480 
attggcttta ccaagtctgt ggcacgcgag gtcgctagtc ggaatatacg agtcaatgtg 540 
attgctccag gaatgattga gtctgatatg accgctatct tatcagataa gattaaggaa 600 
'gctacactag ctcagattcc gatgaaaga-a jtttgggcagg cagagcaggt tgcagatttg 660 
25 acagtatttt tagcaggcca agattatct'a actggtcaag* tggttgccat tgatggtggc 720 
ttaagtatgt ag 732 

<210> 32 
<211>*831 
30 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 32 

atgggagtga aaaagaaact aaagttgact agtttgctag gactgtctct gttaatcatg 60 

35 acagcctgtg cgactaatgg ggtaactagc gatattacag ccgaatcggc tgatttttgg 120 

agtaaattgg tttacttctt tgcggaaatc attcgctttt tatcgtttga tattagtatc 180 

ggagtgggga ttattctctt tacggtcttg, attcgtacag tcctcttgcc agtctttcag 240 

gtgcaaatgg tggcttctag gaaaatgcag gaagctcagc cacgcattaa ggcgcttcga 300 

gaacaatatc caggtcgaga tatggaaagc agaaccaaac tagagcagga aatgcgtaaa 360 

40 gtatttaaag aaatgggtgt cagacagtca gactctcttt ggccgatttt gattcagatg 420 

ccggttattt tggccctgtt ccaagcccta tcaagagttg actttttaaa gacaggtcat 480 

ttcttatgga ttaaccttgg -^agtgtggat acaacccttg ttcttccgat tttagcagca 540 

gtattcacct ttttaagtac ttggttgtcc aacaaagctt tgtctgagcg aaatggcgct 600 

acgactgcga tgatgtatgg gattccagtc ttgattttta tctttgcagt ttatgcgcca 660 

45 ggtggagtcg ccctatactg gacagtgtct aatgcttatc- aagtcttgca aacctatttc 720 

ttgaataatc cattcaagat tatcgcagag cgcgaggccg tagtacaggc acaaaaagat 780 

ttggaaaata gaaaaagaaa agccaagaaa aaggctcaga aaacgaaata a 831 

<210> 33 
50 <211> 1230 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 33 

55 atgaagatta gtaagaggca cttattaaat tattccatct tgattcccta cttactttta 60 
tctattttgg gcttgattgt ggtctattcg accaccagtg ctattttaat tgaagaaggc 120 
aagagcgcct tgcagttggt tcgaaaccaa ggaatctttt ggattgttag tttgatactg 180 
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attgccttaa tttataaatt gagactagat tttttgagaa atgagcgact aatcatttta' 240 
gttatattaa tagaaatgct tttattgttc ttggctcgtt ttattggtat ttcagtaaac 300 
ggggcatacg gttggatttc ggttgcagga gtaactattc agccagctga gtacttaaaa 360 
atcattatta tttggtatt,t agctcaccga ttctccaaac agcaiagaaga aatagctact 420 
tatgattttc aagttttgac tcaaaatcaa tggcttcccc gtgcttttaa 1 tgattggqga 480 
ttcgttctcc tagttctgat tggaagtttg ggaattttcc ctgatttagg aaatgcgact 540 
attttagtct tggtttcctt gattatgtat acagttagtg' gaatcgctta tcgctggttt 600 
tcaaccattc tggcgctcgt atctgccact tctgtctttg tcttgaccac tatcagccta 660 
atcggtgttg agaccttttc aaaaattcca gtatttggct atgtagccaa gcgctttagt 720 
gcctttttta atccttttgc cgatcgtgct gatcfcaggtc accagttagc taattcttat 780 
tttgccatgg tcaatggtgg ttggtttggt ctaggtcttg: gaaactcgat t'gaaaaacga 84 0 
ggttatttgc cagaagctca tacagacttt gtcttttcta tcgtgattga agaatttggc 900 
tttgttggtg ccagtcttat tttagctctc '.ttgtttttca tgattttgcg gattatcttg 960 
gtcggtatcc 'gagcggagaa tcctttcaat gccatggttg cactcggtgt cggagggatg 1020 
atgttggttc aggtatttgt caatatcgga gggatttcgg gcttgattcc atctacagga 1080. 
gtgaetttcc ctttcttatc ccagggtgga aatagtcttc tagtcttatc agtggcagta 1140 
gcctttgtct taaatattga tgccagtgaa aaacgcgcta agttgtaccg agaattggaa 1200 

aatcaaccaa tgaaccttct gttgaagtag 1230 

t 1 ' i 

<210> 34 , . ( 

<211> 1260 
<212> DNA 1 

<213> Streptococcus pneumoniae 

( * ■ 

<400> 34 

atgctcggaa ttttaacctt tattctggtt tttgggatta ttgtagtggt gcacgagttc 60 
gggcacttct actttgccaa gaaatcaggg attttagtac gtgaatttgc catcggtatg 120 ! 
ggacctaaaa tctttgctca cattggcaag gatggaacgg* cctataccat tcgaatcttg 180 
cctctgggtg gctatgtccg catggccggt tggggtgatg atacaactga aatcaagaca 24 0 
ggaacgcctg ttagtttgac acttgctgat gatggtaagg ttaaacgcat caatctctca 300 
gjtaaaaaat tggatcaaac agccctccct atgcaggtga cccagtttga ttttgaagac 360 
aagctcttta tcaaaggatt ggttctggaa gaagaaaaaa catttgcagt ggatcacgat 420 
gcaacggttg tggaagcaga tggtactgag gttcggattg cacctttaga tgttcaatat 480 
caaaatgcga ctatctgggg caaactgatt accaattttg caggtcctat gaacaatttt 540 
atcttaggtg tcgttgtttt ttgggtttta' atctttatgc agggtggtgt cagagatgtt 600 
gataccaatc agttccatat catgccccaa ggtgccttgg ccaaggtagg agtaccagaa 660 
acggcacaaa ttaccaagat cggctcacat gaggttagca actgggaaag cttgatccaa 720 
gctgtggaaa cagaaaccaa agataagacg gcaccgactt tggatgtgac tatttctgaa 780 
aaggggagtg acaaacaagt cactgttaca cccgaagata gtcaaggtcg ttaccttcta 840 
ggtgttcaac cgggggttaa gtcagatttt ctatccatgt ttgtaggtgg ttttacaact 900 
gctgctgact cagctctccg aattctctca gctctgaaaa atctgatttt ccaaccggat 960 
ttgaacaagt tgggtggacc tgttgctatc tttaaggcaa gtagtgatgc tgctaaaaat 1020 
ggaattgaga atatcttgta cttcttggca atgatttcca tcaatattgg gatttttaat 1080 
cttattccga ttccagcctt ggatggtggt aagattgtgc tcaatatcct agaagccatc 1140 
cgccgcaaac cattgaaaca agaaattgaa acctatgtca ccttggccgg agtggtcatc 1200 
atggttgtct tgatgattgc tgtgacttgg aatgacatta tgcgactctt ttttagataa 1260 

<210> 35 
<211> 594 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 35 

atgtacgcat atttaaaagg aatcattacc aaaattactg ccaaatacat tgttcttgaa 60 
accaatggta ttggttatat cctgcatgtg gccaatcctt atgcctattc aggtcaggtt 120 
aatcaggagg ctcagattta tgtgcatcag gttgtgcgtg aggacgccca tttgctttat 180 
ggatttcgct cagaggatga gaaaaagctc tttcttagtc taatttcggt ctctgggatt 240 
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15 



20 



25 



35 



40 



45 



50 



55 



ggtcctgtat „cagct:cttgc tattatcgct gejtgatgaca atgctggctt gg.ttcaagcc 300 
attgaaacca agaacatcac ctacttgacc aagttcccta aaattggcaa gaaaacagcc 3€0. 
cagcagatgg tgctggactt ggaaggcaag gtag;tagttg cagpagatga ' ccttcctgcc 420 
aaggtcgcag tgcaagcaag tgctgaaaac caacjaattgg aagaagctatf ggaagccatg * 480 
ttggctctgg gctacaaggc aacagagctc aagaaaatca agaaattctt tgaaggaacg 540 
acagatacag ctgagaacta tatcaagtcg gcccttaaaa ' tgttggtcaa atag' • ' 594 



<210> 36 
<211> 774 
10 <212> DNA . , ' 

<213> Streptococcus pneumoniae 



<400> 36 • ' i ! 

atgaagaata atcgtatttt agcactttct ggaaatgata tttttagtgg tggtggactg 60 
tcagctgatt tggctaccta taccttgaac ggcttgcatg 1 ggtttgtagc agtgacttgt 120 
ttgacagcct tgacagaaaa aggatttgaa gtctttccaa ctgatgatac catttttcaa 100 
catgaattag atagcttgcg tgatgtggaa tttgggggaa ttaagattgg tcttctccct 240 
actgtcagtg tggctgagaa ggccttggac tttatcaaac, aacgcccagg agtacctgtg 300 
gtgttggatc ctgtcttggt ctgcaaggaa acgcatgatg tag.ctgtcag tgagctctgc 360 
caagagttga ttcgcttctt ! cccttatgtc agtgtgatta cgcctaatct cccagaagca 420 
gaattattat ccggtcagga aattaaaacc ttggaagaca tgaaaacfrgc agcgcagaaa 480 
ttgcatgatt taggag<:gcc agcagtcatt atcaagggag gcaatcgtct tagtcaggac 540 
aagg.ctg.tgg atgtctttta tgatggacag acctttacta tcctagaaaa tccagttatc 600 
caaggccaaa atgctggtgc aggttgtacc tttgcctcta gcattgccag tcacctggtt 660 
aaaggtgata aatttttgcc agcagtagaa agctctaagg ctttcgttta tcgtgctatt 720 
gcacaagcag atcagtatgg agtaagacaa tatgaagcaa acaaaaacaa ctaa 774 



<210> 37 
<211> 1239 
30 <212> DNA 

<213> Streptococcus pneumoniae 



<400> 37 

atgattgaaa cggagaaaaa agaggagcga gtcctgctga ttggtgtgga attgcagggt 60 
atggacagtt ttgacctctc catggaagaa ttggctagtt tagcgaaaac ggcaggggca 120 
gtcgttgtag atagctacag acaaaaacgt gaaaaatatg attccaagac cttcgtcggc 180 
tctggtaagt tggaagagat tgcgcttatg gtggatgcag aagaaatcac tactgtcatc 240 
gtcaacaatc gtctgacccc aaggcagaat gtcaatctag aggaagttct cggtgttaag 300 
gtcattgacc gtatgcagtt gattttggat atctttgcca tgcgggctcg aagccatgaa 360. 
gggaagctcc aagtccacct agcccaattc aaatacctct tgcctcgctt ggttggtcag 420 
gggattatgc tcagccgtca ggcaggggga attggttccc gtggtcctgg tgaaagccaa 480 
ctggagctga accgtcgtag cgttcgcaat caaatcacgg atatcgagcg ccagcttaag 54 0 
gtggttgaga aaaatcgtgc gactgtcaga gaaaaacgtt tggagtctag cacttttaag 600 
attggtttga ttggttatac taatgctggg aaatcaacta tcatgaacat cttgaccagt 660 
aagacccagt atgaagcaga tgagctcttt gcgactctgg atgcgacaac caagagtatt 720 
catctgggag gcaatctcca agtaactttg acagataccg ttggctttat ccaagatttg 780 
ccgacagagt tggtgtccag tttcaagtca accttggaag aaagcaagca tgtggacctt 840 
ctggttcatg ttatcgatgc tagcaatcct taccacgagg* agcatgaaaa aacggttctc 900 
tccatcatga aagacctgga catggaagat attcctcact tgacgcttta taataaagcg 960 
gatttggtgg aggatttcac gcctacccaa- acgccatata ccctcatttc tgccaagtct 1020 
gaggacagtc gtgaaaactt gcaagcatta ttgctagata agattaagga aatttttgaa 1080 
gcatttaccc tgcgagtgcc tttttcaaag tcctacaaga ttcatgattt agagagtgtt 1140 
gcaattctgg aagaacgtga ttatcaggaa gacggcgaag tgattacagg .ctacatttcc 1200 
gagaaaaata aatggaggtt agaagaattt tatgactga 1239 

<210> 38 
<211> 483 
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<212> DNA 

<213> Streptococcus pneumoniae 

i , 

<40 ?> 38 , .. • , , , : . ... 

atggcagaaa aaacatatcc tatgaccctt gaggaaaagg, aaaaacttga aaaagaatta ,60 1 
gaagaattga aattggttcg tcgaccagaa gtggtagaac igcattaagat tgcccgttca ,120 
tacggtgacc tttcagaaaa cagtgagtac gaagcagcta' aggatcfaaca agcctttgtic 180 
gaaggacaaa tctctagctt agaaacaaaa atccgctatg ctgaaatcgt caatagcgac 24 0 
gcagttgccc aggacgaagt agcgattggt aaaacagtca ccatccaaga aattggtgag 300 
gacgaagaag aa'gtttatat tatcgtaggt tcagctggtg -cggatgcctt tgcaggtaag 360 
gtttcaaatg aaagcccaat tgggcaggcc ttgattggqa agaa'aacagg tgatacagca 420 
acc.attgaaa cgcctgttgg tagctatgat gtaaaaatct tgaaggttga aaaaacagcc 4 80 



1 taa 

15 <210> 39 
<2'll> 570 
<212> DNA 

<213> .Streptococcus pneumoniae 



4'83 



I I.- 



20 <400> 39 

atg'accaaat tacttgtagg cttgggaaat . ccaggggata aatattttga aacaaaacac 60 
aatgttggtt ttatgttgat tgatcaact;a gcg.aagaaac agaatgtcac ttttacacac 120 
gataagatat ttcaagctga cctagcatcc tttttcctaa atggagaaaa aatttatctg 180 
gttaaaccaa cgacctttat ' gaatgaaagjt ggaaaagcag ttcatgcttt attaacttac 240 

25 tatggtttgg atattgacga tttacttatc atttacgatg atcttgacat ggaagttggg 300 
aaaattcgtt taagagcaaa aggctcagca ggtggtcata atggtatcaa gtctattatt $60 
caacatatag gaactcaggt ctttaaccgt gttaagattg gaattggaag acctaaaaat 420 
ggtatgtcag ttgttcatca tgttttgagt aagtttgaca gggatgagta tatcggtatt 480 
ttacagtctg ttgacaaagt tgacgattct gtaaactact atttacaaga gaaaaatttt 540 

30 gagaaaacaa tgcagaggta taacggataa 570 

<210> 40 
<211> 852 
<212> DNA 
35 <213> Streptococcus pneumoniae 

<400> 40 

atgattttaa ttacaggggc aaatggccaa ttaggaacgg aacttcgcta tttattggat 60 
gaacgtaatg aagaatacgt ggcagtagat gtggctaaga tggacattac caatgaagaa 120 
atggttgaga aagtttttga agaggtgaaa ccgactttag tctaccattg tgcagcctac 180 
accgctgttg atgcagcaga ggatgaagga aaagagttgg acttcgccat caatgtgacg 240 
gggacaaaaa atgtcgcaaa agcatctgaa aagcatggtg caactctagt ttatatttct 300 
acggactatg tctttgacgg taagaaacca gttggacaag agtgggaagt tgatgaccga 360 
ccagatccac agacagaata tggacgcact aagcgtatgg gggaagagtt agttgagaag 420 
catgtgtcta atttctatat tatccgtact gcctgggtat ttggaaatta tggcaaaaac 480 
ttcgttttta ccatgcaaaa tcttgcgaaa actcataaga ctttaacagt tgtaaatgat 540 
cagtacggtc gtccgacttg gactcgtacc ttggctgagt tcatgaccta cctagctgaa 600 
aatcgtaagg aatttggtta ttatcattt'g tcaaatgatg cgacagaaga cacaacatgg 660 
tatgattttg cagttgaaat tttgaaagat acagatgtcg aagtcaagcc agtagattcc 720 
agtcaatttc cagccaaagc taaacgtccg ctaaactcaa cgatgagcct ggccaaagcc 780 
aaagctactg gatttgttat tccaacttgg caagatgcat tgcaagaatt ttacaaacaa 840 
gaagtgagat aa * 852 



<210> 41 
55 <211> 1224 
<212> DNA 

<213> Streptococcus pneumoniae 
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<400> 41 

atgaaacgtt ctctcgactc aagagtcgat tacagt.ttgc tcttgccagt attttttcta 60 

, ctggtcatcg gtgtggtggc tatctatata gccgtt'agtc' atgat'tacbc caataatatt i2Q 

5 ctgcccattt tagggcagca ggtcgcctgg attgccttgg ggcttgtgat tggttttgtg ' 180 

gtcatgctct ttaatacaga atttct.ttgg aaggtgaccc ' cctttctata tattttaggc ' 240 

ttgggactta tgatcttgcc . gattigtattt tataatccaa gcttagttgc atcaacgggt 300 

gccaaaaact gggtatcaat aaatggaatt accctattcc aaccgtcaga atttatgaag 360 

atatcctata tcctcatgtt ggctcgtgtc attgtccaat ttacaaagaa acataaggaa 420 

10 tggagacgca cggttccgct ggactttttg ttaattttct ' gcjatgat'tct ctttaccatt 480 

ccagtcctag ttcttttagc acttcaaagt gacttgggga cggetttggt ttttgtagcc 540 
* attttctcag gaatcgtttt attatcag<jg gtttcttgga aaattattat , cccagtattt ( 600 

gtgactgctg taacaggagt tgctggtttjc ttagctatct ttattagcaa ggacg'gacga 660 

gcttttcttc accagattgg aatgccgacc taccaaatca atcggatttt ggcttggctc '720 

15 aatccctttg agtttgccca aacaacgact taccagcagg ctcaagggca gattgccatt 780 

gggagtggtg gcttatttgg tcagggattt aatgcttcga atctgcttat cccagt.tcaa 840 

gagtcagata tgatttttac ggttattgca gaagattttg gctttattgg ctctgtcctcf 500 

gttattgccc tctatctcat gttgatttac cgtatgttga agattactct taaatcaaat 960 
aaccagttct acacttatat ttccacaggt ttgattatga tgttgctctt, ccacatcttt 1020 

20 gagaatatcg gtgctgtgac tggactactt cctttgacgg ggattccctt gcctttcatt 1080 

tcgcaagggg gatcagctat tatcagtaat' ctgattggtg ttggtttgct tttatcgatg 1140 

agttaccaga ctaatctagc tgaagaaaag agcggaaaag ttccattcaa acggaaaaag 1200 
gttgtattaa aacaaattaa ataa j 1224 

25 <210> 42 ' 

<211> 609 * ! ' . 

<212> DNA 

<213> Streptococcus pneumoniae 
30 <400> 42 

atgggaaaaa tcatcggaat cactggggga attgcctcag gtaagtcaac tgtgacaaat 60 

tttctaaaac accaagggct ttcaagcagt ggattgccga cgcagtgttc caccaactac 120 

agaaaacctg gtggtcgtct gtttgaggct ttagtacagc actttgggca agaaatcatt 180 

cttgaaaacg gagaactcaa tcgccctctc atagctagtc tcatcttttc aaatcctgaa 240 

35 gagcaaaaat ggtctaatca aattcaaggg gagattatcc gtgaggaact ggctactttg 300 

agagaacagt tggctcagac agaagagatt ttcttcatgg atattcccct actttttgaa 360 

caggactaca gcgattggtt tgctgagact tggttggtct atgtggaccg agatgcccaa 420 

gtagaacgct taatgaaaag ggaccagttg tccaaagatg aagctgagtc tcgtatggca 480 

gcccagtggc cttta.gaaaa aaagaaagat ttggccagcc aggttcttga taataatggc 540 ' 

40 aatcagaacc agcttcttaa tcaagtgcat atccttcttg agggaggtag gcaagatgac 600 

agagattaa 609 

<210> 43 

<211> 1260 

45 <212> DNA 

<213> Streptococcus pneumoniae 

. <400> 43 

atgagaaaaa ttgttatcaa tggtggatta ccactgcaag gtgaaattac tattagtggt 60 
50 gctaaaaata gtgttgtggc cttaattcca gctattatat tggctgatga tgtggtgact 120 
ttggattgtg ttccagatat ttcggatgta gccagtcttg tcgaaatcat ggaattgatg 180 
ggagctactg ttaagcgtta tgacgatgtc ttggagattg atccaagagg tgttcaaaat 240 
attccaatgc cttatggtaa aattaacagt cttcgtgcat cttactattt ttatgggagc 300 
ctcttaggcc gttttggtga agcgacagtt ggtctaccgg gaggatgtga tcttggtcct 360 
55 cgtccgattg acttacacct taaggcgttt gaagctatgg gtgccactgc tagctacgag 420 
ggagataaca tgaagttatc tgctaaagat acaggacttc atggtgcaag tatttacatg 480 
gatacggtta gtgtgggagc aacgattaat acgatgattg ctgcagttaa agcaaatggt 540 
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cgtactatta ttgaaaatgc agcccgtgaa cctgagatta ttgatgtagc tactctqttg. 600 
aataatatgg gtgcccatat ccgtggggca ggaactaata tcatcattat tgatggtgtt 660 
gaaagattac atgggacacg tcatcaggtg attccagacc gcattgaagc .tggaacatat 720 
atatctttag ctgctgcagt tggtaaagga attcgtataa ataatgttct ttacgaacac 780 
5 ctggaagggt ttattgctaia gttggaagaa a^gggagtga gaatgactgt atctgaagac' 84 0 
agcatttttg tcgacygaaca gtctaatttg aaagcaatca atattaagac agctccttac, 900 
ccaggctttg caactgattt gcaacaaccg cttaccccto tttt'actaag agcgaatggt 960 
cgtggtacaa ttgtcgatac gatttacgaa aaacgtgtaa atcatgtttt tgaactagca 1020 
aagatggatg cggatatttc gacaacaaat ggtcatattt tgtacacggg tggacgtgat 1080 
10 ttacgtgggg ccagtgttaa agpgaccgac ttaagagctg, gggctgcact agtcattgct 114 0 
gggcttatgg ctgaaggtaa aa'ctgaaatt acc'aatattg agtttatctt acgtggttat 1200 

tctgatatta tcgaaaaatt acgtaattta ggagcggata ttagacttgt tgaggattaa 1260 

* i ■ ' * * 1 

<210> 44 i ■ ' • ' 

15 <211> 696 ( ' 

<212> DNA 1 ' 

<213> Streptococcus pneumoniae , - 1 

( 

i i 

<400> 44 ... ! , \ 1 

20 atgtcaagaa ttgaattttc accatctttg atgaccatgg atttggacaa attcaaagag 60 
cagattactt ttttgaatga taaagtagca tcttatcata tcgatattat ggatggccat 120 
tttgttccca i afcattaccfct gtctccttgg ttcattcaag aagttca&aa aattagtgac 180 
acacctttat cagttcatct gatggtcaca gacccaacct tttgggtaga tcaagttctc 240 
gatttacaat gtgagtatat ttgtattcat gctgaagttc tgaatggtct tgcttttcgt 300 ' 

25 ttgattgata aaattcatga tgcaggtcta aaggctggtg ttgtccttaa tcctgaaaca 360 
cctgtttcta caatctttcc ctacattgat ttacttgaca aagtaactat tatgactgta 420 
gatccaggtt ttgcaggaca acgctttttg gagtctacct tgtataaaat ccaagaactc 480 t 
cgtcagctta gagttcagaa tggttatcac tacatcattg agatggatgg ttcttcgagt 54 0 
cgtaagactt tcaaacaaat tgatgtggca ggaccagata tttatgttat aggtcgcagt 600 

30 ggattatttg gtttggatga cgatattgcc aaagcctggg atatctgttc tagagattac 660 
gaagaaatga ccggaaaaac aatgccaatc aaataa 696 

<210> 45 
<2il> 1125 
35 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 45 

atgagaaata tggctttgac agcaggtatc gttggtttgc caaacgttgg taaatcaaca 60 

40 ctatttaatg caattacaaa agcaggagca gaggcagcaa actacccatt tgcgactatt 120 

gatccaaatg ttggaatggt ggaa'gatcca gatgaacgcc tacaaaaact aactgaaatg 180 
ataactccta aaaagacagt tcccacaaca tttgaattta cggatattgc agggattgta 240 

aaaggagctt caaaaggaga agggctaggg aataaattct tggccaatat tcgtgaagta 300 
gatgcgattg ttcacgtagt tcgtgctttt gatgatgaaa atgtgatgcg cgagcaagga 360 

45 cgtgaagacg cctttgtaga tccacttgca gatattgata caattaatct ggaattaatt 420 

cttgctgact tagaatcagt gaacaaacga tatgcgcgtg tagaaaagat ggcacgtacg 4 80 

caaaaagata aagaatcagt agcagaattc aatgttcttc aaaagattaa accagtccta 540 

gaagacggga aatcagctcg taccattgaa tttacagatg aggaacaaaa ggttgtcaaa 600 

ggtcttttcc ttttgacgac taaaccagtt ctttatgtag ctaatgtgga cgaggatgtg 660 

50 gtttcagaac ctgactctat cgactatgtc aaacaaattc gtgaatttgc agcgacagaa 720 

aatgctgaag tagtcgttat ttctgcgcgt gctgaggaag aaatttctga attggatgat 780 

gaagataaaa aagagtttct tgaagccatt ggtttgacag aatcaggtgt agataagttg 840 

acgcgtgcag cttaccactt gcttggattg ggaacttact tcacagctgg tgaaaaagaa 900 

gttcgcgctt ggactttcaa acgtggtatg aaggctcctc aagcagctgg tattatccac 960 

55 tcagactttg aaaaaggctt tattcgtgca gtaaccatgt catatgaaga tctagtgaaa 1020 

tacggatctg aaaaggccgt aaaagaagct ggacgcttgc gtgaagaagg aaaagaatat 1080 
atcgttcaag atggcgatat catggaattc cgctttaatg tctaa 1125 



18 



WO 01/49721 PCTYUS00/35604 



50 



55 



<2io> 46 

<211> 333 

, .<212> DNA • . ' , • ' ' '.' ['^ ' . 

5 <213> Streptococcus pneumoniae ' ' 

<400> 46 

atggaaatcg aaaaaaccaa tcgtatga.at gcgctctttg aattttatgc ggcgcttttg 60 

acagataagc aaatgaatta tatagagctt tactacgctg atgattacag tcttgctgag 120 

10 atagctgagg agtttggtgt tagtcgtcag gctgtctatg acaatatcaa gcgaacagaa 180 

aagattctgg aagattatga gatgaaattg cacatgtact cggactacat tgtccgtagt 24 0 

cagatttttg accaaatctt ggagcgctat cccaaggatg attttctgca. ggagcagata , 300 

gaaattttaa caagqattga taatagagaa taa 1 ' 333 

15 <210> 47 
<211> 672 

<212> DNA ' i 1 ' 

<213> Streptococcus pneumoniae ; 

20 <400> 47 

atgaccttag aatgggaaga atttctagat' ccttacattc aagctgttgg tgagttaaag 60 

attaaacttc gtggtattcg taagcaatat cgtaagcaaa ataagcattc tccaattgag 120 

tttgtgaccg gtcgagtcaa gccaattgag agcatcaaag aaaaaatggc tcgtcgtggc 180 

attacttatg cgaccttgga acacgatttg ,caggatattg ctggcttacg tgtgatggtt 240 

25 cagtttgtag atgacgtcaa ggaagtagtg gatattttgc acaagcgtca ggatatgcga 300 

atcatacagg agcgagatta cattactcat agaaaagcat caggctatcg ttcctatcat 360 

gtggtagtag aatatacggt . tgataccatc aatggagcta agactatttt ggcagaaatt 420 

caaattcgta ctttgcjccat gaatttctgg gcaacgatag aacattctct caactacaag 4 80 

taccaagggg atttcccaga tgagattaag aagcgactgg aaattacagc tagaatcgcc -540 

30 catcagttgg atgaagaaat gggtgaaatt cgtgatgata tccaagaagc ccaggcactt 600 

tttgatcctt tgagtagaaa attaaatgac ggtgtaggaa acagtgacga tacagatgaa 660 
gaatacaggt aa 672 

<210> 48 
35 <211> 588 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 48 

40 atggaactta atacacacaa tgctgaaatc ttgctcagtg cagctaataa gtcccactat 60 
ccgcaggatg aactgccaga gattgcccta gcagggcgtt caaatgttgg taaatccagc 120 
tttatcaaca ctatgttgaa ccgtaagaat ctcgctcgta catcaggaaa acctggtaaa 180 
* acccagctcc tgaacttttt taacattgat gacaagatgc gctttgtgga tgtgcctggt 24-0 
tatggctatg ctcgtgtttc taaaaaggaa cgtgaaaagt gggggtgcat gattgaggag 300 
45 tacttaacga ctcgggaaaa tctccgtgcg gttgtcagtc tagttgacct tcgtcatgac 360 
ccgtcagcag atgatgtgca gatgtacgaa tttctcaagt attatgagat tccagtcatc 420 
attgtggcga ccaaggcgga caagattcct cgtggtaaat ggaacaagca tgaatcagca 480 
atcaaaaaga aattaaactt tgacccaagt gacgatttca tcctcttttc atctgtcagc 540 
aaggcaggga tggatgaggc ttgggatgca atcttagaaa aattgtga 588 



<210> 49 
<211> 294 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 49 

atgaaaacaa gaaaaatccc tttgcgcaag tctgttgtgt ctaacgaagt gattgataag 60 
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i » 

cgtgatttgc tccgcattgt taagaacaag gaaggacaag tctttattga tcctacgggo 120 
aaggccaatg gccgcggcgc ttatatcaaa ctagacaatg cagaagccct agaggcgaaa 180 
aagaagaagg tctttaaccg cagctttagc atggaagtgg aagaaagctt .ttatgacgag 240 
ttgatcgctt atgtgga'tc^a caaagtgaaa agaa^agagt tgggacttga ataa ■ 294 



i ' 



<210> 50 t| . 

<211> 312 
<212> DNA 

<213> Streptococcus pneumoniae 



<4 00> 50 1 ' ' 1 ; , , ' 

atgttaaaac cctctattga taccttgctc gacaaggttc cttcaaaata ticactcgta 60 
atcttggaag caaaacgtgc ccacgaattg« gaagcaggtg ccccagcaac tcaaggtttc 120 
aagtdtgaaa J aatcaactct tcgcgcttta ' gaagaaatcg aatcaggaaa cgttacaatt 180 
15 cacccagatc cagaagcjaaa acgtgaagca gtgcgtcgcc , gtatcgaaga agaaaaacgc 240 
cgcaaagaag aagaagaaaa gaaaatcaaa gagcaaattg ctaaagaaaa agaagatggt 300 

gaaaaaattt aa i 312 

i 

• i t 

<210> 51 ' 1 

20 <211> 312 . , 

<212> DN£ ( 

<213> Streptococcus pneumoniae 1 

<400> 51 ■ ■ , , 

25 atgtcattaa catcaaaaca acgtgccttc ctcaacagcc aggcacacac cctcaaacct 60 

atcatccaaa tcgggaaaaa tggactcaac gaccaaatca aaaccagcgt ccgtcaagct 120 

cttgatgcgc gtgaattaat caaggttact ctcttacaaa acacagatga aaacatccac 180( 

gaagtagctg aaattttgga agaagaaatc ggtgtggata cagtccaaaa aataggacgc 240 

atcttgattt tgtttaaaca atctagcaag 1 aaagaaaatc gcaagatttc taagaaagtc 300 

30 aaagaaatct aa 312 

<210> 52 
<211> 528 
<212> DNA 
35 <213> Streptococcus pneumoniae 

<400> 52 

atggcgattg aaaattatat accagatttt gctgtggaag cagtctatga tctgacagtc 60 

ccaagcctgc aggcgcaggg aatcaaggct gttttggtcg atttggataa taccctcatt 120 

40 gcttggaaca accctgatgg aacgccagag atgaagcaat ggctacatga ccttcgggac 180 

gcgggtattg gcattatcgt agtgtcaaat aacaccaaaa aacgcgttca acgagcagtt 24 0 

gagaaatttg- ggattgatta cgtttactgg gccttgaagc ■ ccttcacatt tggtattgac 300 

cgtgctatga aggaattcca ctatgacaaa aaggaagtgg tcatggttgg tgaccagctc 360 

atgacagata tacgagcagc ccaccgtgca gggattcggt caattttagt caaacccttg 420 

45 gtccaacatg actcaatcaa aacgcagatt aaccgaactc gtgagcgtcg tgttatgaga 480 

aaaatcactg aaaagtacgg accgattaca tataaaaaag gaatttaa 528 

<210> 53 
<211> 1368 
50 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 53 

atgtttcgaa aaattttaat tgccaatcgt ggtgaaattg cggttcgtat tatccgtgcg 60 

55 gcacgtgaat tggggattgc gacggtagcg gtttattcaa ctgctgataa ggaagctctt 120 

catacgctgt tggcagatga agcagtttgt attggtcctg gcaaggcaac agagtcttat 180 

ctcaatatta atgcagttpt atcagctgca gtcttgactg aggcagaagc tattcaccct 240 
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ggttttggat ttctcagtga aaattccaaa tttgcgacca tgtgtgaaga aataggtatc 300 
aagtttatcg gtccatctgg tcatgttatg gatatgatgg gggataaaat caatgcgcgt 360. 
gctcagatga ttaaagcagg tgtgcctgtt ataccacjgtt cagatggaga agtgcataac 420 
tctgaagaag ctttgatltgt tgctgaaaaa attggctatc ctgttatgct caaggcttca 480 
5 ' gcaggtggag gtggtaaagg gattcgtaag gttgaaaaac cagatgacct cgtttctgcc 540 
tttgaaactg cctctagtga ggccaaggcc aattatggca • atggtgccat gtacatagaa 1 600 
cgggttatct atccagct ( cg gcacattgag gttcaaatcc taggtgatga gcatggacat 660 
gtgattcact tgggtgaacg ggattgttct cttcaaagga ataaccaaaa ggttttggaa 720 
gaaagtccct cgattgcaat. cggaaaaacg ctgcgtcatg aaat,aggtgc tgctgctgtt 780 
10 cgagcggcag agtttgttgg ctatgagaat gcaggaacca ttgaatttct tcttgatgaa 840 
gcaagtagca atttctattt ( catggagatg aatactcgtg ttcaggtacya aqatccagta 900 
acagagtttg tttcaggtgt tgatatcgtt aaggaacaga tttgcattgc ggcaggtcag 960 
cctttgtctg ttaagcaaga agatattgtc'.ctacgcggtc atgccatcga gtgtcgtatc i020 
aatgcagaaa acccagcctt taactttgct ccaagtccag gtaagattac taatctctat 1080 
15 ctgccaagtg gtgg'agttgg ct'tgcgcgtg gattcagcag 1 tttatccagg ttataccatt 1140 
ccgccttatt atgatagtat gattgccaaa atcatagtac .acggcgaaaa tcgttttgac 1200 
gccttgatga aaatgcaacg tgccctctat gaattagaaa ttgaaggagt gcagacc'aat 1260 
gcagatttcc agcttgacct catttcagat cgcaatgtca ttgctgggga ttatgatact 1320 
tgcttcttga tggaaacctt cttacctaaa tatcaagaaa aagaataa 1368 
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<210> 54 
<211> 234 1 
<212> DNA 

<213> Streptococcus pneumoniae' 



<400> 54 

atgatttaca aagtttttta tcaagaaaca aaagaacgta gcccacgccg tgaaacaaca 60 1 

cgcgcgcttt acctagacat cgataccagc ( tcagaacttg agggccgtat cactgctcgc 120 

caacttgtcg aagaaaatcg cccagagtac aatatcgaat atatcgaact cttgtctgac 180 

30 aaattgctcg attacgaaaa agaaactggc gccttcgaaa ttacggagtt ctaa 234 

<210> 55 

<211>. 1011 

<212> DNA 

35 <213> Streptococcus pneumoniae 

* 

<400> 55 

atgaaggata gatatatttt agcatttgag acatcctgtg atgagaccag tgtcgccgtc 60 
ttgaaaaacg acgatgagct cttgtccaat gtcattgcta gtcaaattga gagtcacaaa 120 
40 cgttttggtg gcgtagtgcc cgaagtagcc agtcgtcacc atgtcgaggt cattacagcc 180 
-tgtatcgagg aggcattggc agaagcaggg attaccgaag aggacgtgac agctgttgcg 240 
gttacctacg gaccaggctt ggtcggagcc ttgctagttg gtttgtcagc tgccaaggcc 300 
tttgcttggg ctcacggact tccactgatt cctgttaatc acatggctgg gcacctcatg 360 
gcagctcaga gtgtggagcc tttggagttt cccttgctag ccctcttggt cagcggcgga 420 
45 cacacagagt tggtctatgt ttctgaggct ggcgattaca agattgttgg ggagacacga 480 
gatgacgcgg ttggcgaggc ctatgataag gtcggccgtg tcatgggctt gacctatcct 540 
gcaggtcgtg agattgacga gctggctcat caggggcagg atatttatga tttcccccgt 600 
gccatgatta aggaagataa tctggagttt tcattctctg gtttgaagtc agcctttatc 660 
aatcttcatc acaatgccga gcaaaaggga gaaagcctgt ctacagaaga tttgtgtgct 720 
50 tccttccaag cagcagttat ggacattctc atggcaaaaa ccaagaaggc tttggaggaa 780 
tatcctgtta aaaccctatt tgtggcaggt ggtgtggcag ccaataaagg tctcagagaa 840 
cgcttagcag ccgaaatcac agatgtcaag gttatcatcc cccctctgcg actctgcgga 900 
gacaatgcag gtatgattgc ctatgccagc gtcagcgagt ggaacaaaga aaacttcgca 960 
ggctgggacc tcaatgccaa accaagtctt gcctttgata ccatggaata a 1011 



<210> 56 
<211> 1809 
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<212> DNA 

213> Streptococcus pneumoniae ■ '• 

f i 

<400> 56 , ( ( ; , ' 
atgtgtggaa ttgttggtgt tgttggaaac acaaatgcaia, ctigatat.ttt gattcaagcyg . 60 1 

cttgaaaagc ttgaataccg tggctatgat tctgcgggaa ftttttgtcct agatggtgct ,120 

gataaccatt tggtgaaggc ggttggtcgt attgcagaat tgtctgccaa gacagctggt 180 

gttgagggaa caactggtat cggacatact cgttgggcta ctcacggaaa accaactgag 240 
gacaatgctc acccacaccg ctctgagaca gaacgttttg tcttggtgca taatggggtg 300 

attgagaact atcttgaaat caaggaagaa taccttgcag -gtcaccactt caagggcjcag 360 

acagatactg aaatagccgt tcatttgatt ggaaaatttg cggaagaaga agggctctca 420 

gttcttgaag cctttaaaaa agctcttcat attatccgtg gttcatatgc ctttgccttg 480 

attgactctg aaaatccaga tgtcatctat gtagcgaaaa acaaiatctcc acttttgatt 540 

ggtcttgggg aaggctacaa tatggtctgc tcagatgcta tggctatgat tc^rtgaaacc 600 

aaccaataca tggaaattca tgaccaagag ttggtaatcg tcaaggctga tagcgtggaa 660 

gt'tcaagact atgatggtaa cagtcgtgaa cgtgctagct atactgcgga acttgacttg 720 
tcagatatcg gtaagggaac ttatccttac tacatgctta aggaaattga tgagcaac^a.* 780 

actgttatgc gfcaaactcat tcaagccta'c acggatgatg ctggtcaagt agtggttgct 840 

cctgctatca ttaaggctgt tcaagacgca gaccgcatct acatccttgc agctggaaca 900 

tcttaccatg caggatttgc ttctaagaaa atgttggaag aattgacaga tacaccagtt 960 

gaacttggaa tctcatctga gtggggctac . ggtatgccac ttctcagcaa gaaaccactc 1020 

ttcatcttta tcagccaatc tggtgaaaca gcggatagtc gtcaagtttt ggtcaaggct 1080 

aatgaaatgg gaattccaag cttaacagtg acaaatgttc caggttcaac cctctcacgt 1140 

gaagccaact ataccatgct ccttcacgca ggacctgaaa ttgcqgtggc atcaactaaa 1200 

gcctatacag cgcaaatcgc agcccttgcc ttccttgcaa aagcagtcgg agaagcaaat 1260 

ggtaatgcta aagcgcaagc ctttgacctg gttcatgaat tgtcaatcgt agctcagtct 1320 

attgaatcaa ctctttcaga gaaagaaacc attgaagcca aggttcgtga acttcttgaa 1380 

acaactcgta acgcctttta catcggacgt ggtcaagatt actacgtagc catggaagca 14 40 

agtctcaaac tcaaagagat ttcttatatc cagtgtgaag gttttgcggc aggagaactc 1500 

aagcacggaa ccattgcctt gattgaagaa ggaacgcctg tt.ttggctct cttgtcagat 1560 

ccagttcttg ccaaccatac tcgtggaaat atccaagagg tcgcagcccg tggtgccaaa 1620 

gtcctcacta tcgcagaaga gaatgtagcc aaagataccg acgatatcgt ccttacgacc 1680 

gtacatccat acctctcacc aatttcaatg gtcgtaccaa cgcaattagt cgcttacttt 1740 

gcaaccctcc accgtggcct cgatgtggac aaaccacgta accttgccaa gtcagtaacg 1800 

gtagaataa 1809 



<210> '57 
<211> 723 
<212> DNA 
40 <213> Streptococcus pneumoniae 

<400> 57 

atgatacgta tcgaaaatct cagtgtctcc tacaaagaaa cgttggcact taaggatatt 60 
tcactagtgc tccatggacc aacaattacc ggcatcattg gtccaaacgg cgctgggaaa 120 

45 tcaacactat taaaaggtat gctgggaatt atcccacatc aaggtcaggc atttctcgat 180 
gacaaggaag ttaaaaaatc cttacaccga attgcctatg tcgaacaaaa aatcaatatc 240 
gactacaact ttcccatcaa ggtcaaggaa tgcgtctcgt taggactatt tccctctatt 300 
cctctctttc gaagtttaaa ggctaaacat tggaagaaag tgcaagaggc ccttgaaatc 360 
gtcggcctag ctgactacgc tgaacgtcaa attagtcaac tgtctggagg tcaattccag 420 

50 cgggtcttga ttgceagatg tttggtgcag gaagccgact atatcctctt ggatgaaccc 480 
tttgctggga ttgactctgt cagtgaggaa atca'tcatga atacgctgag agatttgaaa 540 
aaagctggga agacggttct * catcgttcac cacgacctca gcaagattcc ccactacttc 600 
gatcaagtct tacttgtcaa tcgagaagtg attgcctttg gtccaacaaa agaaactttt 660 
accgaaacca atctaaaaga agcttacggt aatcaactct ttttcaatgg aggtgaccta 720 

55 tga 723 

<210> 58 
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<211> 2223 
<212> DNA 

i 

<213> Streptococcus pneumoniae 

<400> 58 • ' ' 1 , 

atgccgaaag aagtgaattt aacaggcgaa gaagttgtcg 1 ctttaaccaa agaatattta' 60 
acggaagagg atgttcattt. tgtccataag gccttggtct atgctgttga atgccacagt 120 
ggtcaatatc gcaaatcagg cgagccttat atcattcacc ctatpcaagt ggcaggtatt 180 
ttagctaagc taaagctgga tgctgtaapa gtagcttgtg gattcttgca tgatgtggtg 240 
gaagatacag atgcgacctt ggacgatttg gaaagagagt' ttggtcctga tgtgcgggtg 300 
attgttgacg gagttaccaa gcttggcaag gtcgagtaca aatcgatcga ggagcaatta 360 
gcggaaaatc atcgcaagat gctcatggcc atgtctgagg ■ acatccgcgt. tattttggtc, 420 
aaa'ctgtctg accgcttgca caatatgcgg accctgaaac atcttcgaaa 'agacaagcag 480 
gagcgtattt ccaaagaaac catggaaatc tatgccccac ttgcccatcg tttggggatt 540 
15 tccagtgtca aatgggaatt agaagacttg tctttccgtt at'c'tcaatcc aacggagttt 600 
tacaagatta cccatatgat gaaggaaaag cgcagggagc gtgaggcctt ggtggatgag 660 
' gtagtcacaa aattagagga gtatacgaca gaacgtcact tgaaagggaa gatttatggt' 720 
cgtcccaagc atatttactc aattttccgc aaaatgcagg acaagagaaa acggtttgag 780 
gaaatctatg atctgattgc tattcgttgt attttagata cccaaagtga tgtttatgcc 840 
20 atgcttggtt acgtgcatga attttggaaa ccgatgccag gtcgcttcaa agactatatt 900 
gccaaccgca aggccaatgg ttatcagtct' atccatacga ctgtttatgg accaaaaggg 960 
ccgattgaat tccagattcg aaccaaggaa atgcacgagg tggctgagta cggggttgcg 1020 
gctcactggg cttataagaa aggtataaag gggcaagtta acagcaagga atcagctatt 1080 
ggaatgaact ggatcaagga gatgatggag ^tccaagacc aggctgatga tgctaaggaa 1140 
25 tttgtggact ctgttaagga aaactatctlg gctgaggaga tttacgtttt taccccagat .1200 
ggagctgtcc .gttcccttcc caaagattca ggaccgattg attttgccta cgaaatccat 1260 
accaaggtcg gtgaaaaagc aactggtgcc aaggtcaatg gccgcatggt tccactgaca 1320 
accaagttaa agacagggga tcaggttgaa attatcgcca acccgaactc ctttggacct 1380 
agccgtgact ggctcaatat ggtcaagact agcaaggcgc gcaataagat tcgccagttc 1440 
30 tttaaaaacc aagataagga attgtctgtc aacaagggtc gtgagatgct gatggctcag 1500 
ttccaagaaa atggctatgt ggcaaataaa tttatggaca agcgccacat ggatcaagtt 1560 
ctgcaaaaga ccagttacaa gacagaagac tccctctttg cggccattgg ttttggggaa 1620 
atcggtgcga ttaccgtctt taaccgtctg actgaaaagg aacgccgtga ggaagagcgt 1680 
gccaaggcca aggctgaggc agaggagctt gtcaaaggtg gcgaggtcaa ggttgaaaat 1740 
35 aaagaaactc tcaaggtcaa gcatgagggg ggagtggtta ttgaaggtgc ttctggtctc 1800 
ctagtgcgga ttgctaagtg ttgtaacccc gtgcctggtg acgatattgt tggctacatt. 1860 
accaagggtc gtggtgtggc tattcaccgt gtggactgta tgaacctgcg tgcccaagaa 1920 
aactacgagc aacgtctcct tgatgtggaa tgggaagacc agtactctag ctcaaataag 1980 
gagtatctgg cccatatcga tatctacggt ctcaaccgta caggactgtt gaacgatgta 2040 
40 ctgcaagttc tttcaaatac aaccaagaat atttcaacgg tcaatgccca accaaccaag 2100 
gatatgaagt ttgctaatat ccatgtgtcc ttcggtattg ccaacctctc tacactgacc 2160 
acggttgtcg ataaaattaa gagtgtgcca gaagtttact ctgtcaaacg -gaccaacggc 2220 
tag 2223 

45 <210> 59 ■ 
<211> 1479 
<212> DNA 

<213> Streptococcus pneumoniae 
50 <400> 59 

atgtctaatt gggacactaa atttttgaaa aaaggtttta cctttgatga tgtattgctt 60 
attccagctg aaagtcatgt gttgcctaac gatgcagatt taacaactaa attggcagat 120 
aatttgactt taaatatccc aattattacc gctgccatgg acacagttac agagagtcaa 180 
atggccattg ctattgctcg tgcaggcggt ctcggagtta tccataaaaa catgtcaatt 24 0 
55 gctcaacaag cagacgaggt tcgtaaggta aaacgttctg aaaatggagt tattattgat 300 
ccgttcttct tgacgcctga acatacaatt gctgaagcag atgagcttat gggtcgttac 360 
cgcatcagtg gtgttccagt tgttgaaaca cttgaaaatc gtaaattggt tggtattttg 420 
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I ' - 

I i 

acaaaccgag atcttcgttt tatttcagat , tataatcaac caatttcaaa ccatatgact- 480 
agtgaaaatc ttgttactgc tcctgtgggt acggatcttg caacggctga gagtattctt 540 
caagagcatc gtattgaaaa acttccgttg gtcgatgaag aaggcagtct ttctggtttg 600 
atcactatca aagatattqa aaaagttatt gagtttccaa atgcggctaa agatgagttt 660 
ggtcgtcttc tagttgcagg tgcagtaggt gttacttcag a ( tacatttgk acgtgcaga^ 720 
gctctttttg aggcaggagc ggatgcgatt gttattgata .ctgcacatgg tcattctgca, 780 
ggtgtcttgc gtaaaattgc cgagattcgt gctcatttce cagatcggac tttgattgct 840 
ggaaatattg ctactgctga aggtgcacgt gccctttatg aagcgggtgt agacgttgtt 900 
aaggttggta. ttggaccagg ttctatctgt actactcgtg tgattgctgg tgttggtgtt 960 
ccgcaagtaa cagctatcta cgatgctgca gctgttgcgo gcgaatatgg taaaacgatt 1020 
attgctgacg gtgggatca'a gtattctgga gat'attgtaa aagcacttgc tgcaggtgga 1080 
aatgctgtta tgcttggatc 'tatgtttgct ggaactgatg aagctccagg cgaaactga'a 1140 
atcttccaag gacgtaaatt caagacttac i, cg^ggtatgg gatcaattgc tgctatgaag 1200 
aaaggttcaa 'gfcgaccgtta tttccaaggt 'tctgtcaatg aagcaaacaa gcttgttcca 1260 
15 gaaggaattg aaggtc^tgt tgcttataaa ggagcggcag ,ctgatattgt tttccaaatg 1320 
attggtggta ttcgctctgg tatgggttac tgtggtgcag ctaaccttaa agaactacac 1380 
gataatgctc aatttattga aatgtctggt gctggtttga aagaaagcca tcctcatgat 14 4 0 
gtgcaaatta ctaatgaggc accaaattat tctatgtaa ' 1479 

i 1 1 i 

20 <210> 60 * 1 
<211> 1947 

<212> DNA I * i 

<213> Streptococcus pneumoniae 

i- 

25 <400> 60 

atgacagaag aaatcaaaaa tctgcaggca caggattatg atgccagtca aattcaagtt 60 
ttagagggct tagaggctgt tcgtatgcgt ccagggatgt acattggatc aacctcaaaa 120. 
gaaggtcttc accatctagt ctgggaaatt gttgataact caattgacga ggccttggca 180 
ggatttgcca gccatattca agtttttatt gagccagatg attcgattac tgttgtggat 240 
gatgggcgtg gtatcccagt cgatattcag gaaaaaacag gccgtcctgc tgttgagacc 300 
gtctttacag tccttcacgc tggaggaaag ttcggcggtg gtggatacaa ggtttcaggt 360 
ggtcttcacg gggtggggtc gtcagtagtt aatgcccttt ccactcaatt agacgttcat 420 
gttcacaaaa atggtaagat tcattaccaa gaataccgtc gtggtcatgt tgtcgcagat 480 
cttgaaatag ttggagatac ggataaaaca* ggaacaactg ttcacttcac accggaccca 54 0 
aaaatcttca ctgaaacaac aatctttgat tttgataaat taaataaacg gattcaagag 600 
ttggcctttc taaatcgcgg tcttcaaatt tcaattacag ataagcgcca aggtttggaa 660 
caaaccaagc attatcatta tgaaggtggg attgctagtt acgttgaata tatcaacgag 720 
aacaaggatg taatctttga tacaccaatc tatacagacg gtgagatgga tgatatcaca 780 
gttgaggtag ccatgcagta cacaactggt taccatgaaa atgtcatgag tttcgccaat 84 0 
aatattcata cacatgaagg tggaacgcat gaacaaggtt tccgtacagc cttgacacgt 900 
gttatcaatg attatgctcg taaaaataag ttactgaaag acaatgaaga caacctaaca 960 
ggggaagatg ttcgcgaagg cttaactgca gttatctcag ttaaacaccc aaatccacag 1020 
tttgaaggac aaaccaagac caaattggga aatagcgaag tggtcaagat taccaatcgc 1080 
ctcttcagtg aagccttctc cgatttcctc atggaaaatc cacagattgc caaacgtatc 1140 
gtggaaaaag ggattttagc tgccaaggct cgtgtggctg ccaagcgtgc gcgtgaagtc 1200 
acacgtaaaa aatctggttt ggaaatttcc aaccttccag ggaaactagc agactgttct 1260 
tctaataacc ctgctgaaac agaactcttc atcgtcgaag gagactcagc tggtggatca 1320 
gccaaatctg gtcgtaaccg tgagtttcag gct'atccttc caattcgcgg taagattttg 1380 
aacgttgaaa aagcaagtat ggataagatt ctagctaacg aagaaattcg tagtcttttc 1440 
acagccatgg gaacaggatt tggcgcagaa tttgatgttt cgaaagcccg ttaccaaaaa 1500 
ctcgttttga tgaccgatgc cgatgtcgat ggagcccaca ttcgtaccct tcttttaacc 1560 
ttgatttatc gttatatgaa accaatccta gaagctggtt atgtttatat tgcccaacca 1620 
ccaatctatg gtgtcaaggt tggaagcgag attaaagaat atatccagcc gggtgcagat 1680 
caagaaatca aactccaaga agctttagoc cgttatagtg aaggtcgtac caaaccgact 1740 
55 attcagcgtt' ataaggggct aggtgaaatg ■ gacgatcatc agctgtggga aacaaccatg 1800 
gatcccgaac atcgcttgat ggctagagtt tctgtagatg atgctgcaga agcagataaa 18 60 
atctttgata tgttgatggg ggatcgagta gagcctcgtc gtgagtttat cgaagaaaat 1920 
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gctgtctata gtacacttga tgtctaa 
<210> 61 

<211> 267 / 1 i 

5 <212> DNA 1 ' 

<213> Streptococcus pneumoniae 
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<400> 61 

atgggattta ctgaagaaac agtacgtttt aaattggacg attccaataa aaaagaaatt' 60 

agcgaaactt tgacagatgt ttatgcttcg ttgaacgata' agggttacaa cccaattaac 120 

caaatcgtag gttacgtatt ( gagtggagac cctgcc^acg ttcctcgtta tiaataatgca 180 

cgaaatcaaa tccgtaagta tgagcgtgat gaaatcgttg aggaattggt tcgctactac 240 

ctcaaaggac aaggagtcga tctataa ' 1 i , 267 

<210> 62 1 j 

<211> 597 - t ' 

<212> DNA ' , ' 

<213> Streptococcus pneumoniae ( 

<400> 62. ' ■ 

atggtcaact atccacataa agtttcatca caaaaaagac aaacatc^ct ttctcaaccc 60 
aaaaatttcg caaatcga'gg aatgtctttt gaaaagatga tcaatgctac caacgactac 120 
tatttgtctc agggcttggc tgttatacat aagaaaccaa ctcctattca aatcgtacaa 180 
gtggactatc cacaacgaag tcgtgccaag attgttgaag 'cctattttcg acaagcttca 240 
acgacggact attctggcgt ttataatgga tattacatcg actttgaagt- caaggaaaca 300 
aaacaaaaac gtgcgattcc gatgaaaaat tttcatccac atcagattca gcatatggaa 360 
caagtccttg cccaacaagg aatctgcttt gtccttcttc acttttcttc tcagcaagaa 420 1 
acctacttat tgccggcatt cgatttgatt ( cgcttctatc atcaagataa gggacaaaaa 4 80 
tcaatgccac ttgaatatat tcgagaatat ggatatgaaa tcaaggctgg tgccttccct 540 
caaattcctt atctcaatgt tatcaaagaa catttattag gtggtaaaac aagatga 597 

<210> 63 
<211> 867 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 63 

atggctctat ttagtaaaaa agataagtat attcgaatca atcccaatcg ttcggttagg 60 
gaaaaacctc aagctaagcc agaggttcca gatgaattat tttcccagtg tccaggctgt 120 
aagcatacca tctatcagaa ggatctggga agtgaacgta tctgtccgca ctgtagctat 180 
acctttcgta tttctgccca agaacgcttg. gctttgacga ttgatatggg aaccttcaaa 240 
gaattgttta cagggattga aagcaaggat cccttgcatt tccctggtta ccaaaagaaa 300 
ctggcatcta tgcgtgaaaa aacaggtctg catgaagccg ttgtgacagg aactgctctt 360 
attaaaggtc agactgtggc tcttgggatt atggattcta actttatcat ggcttctatg 420 
ggtacggttg taggtgaaaa aatcactcgt ttgtttgagt atgcgactgt cgaaaaattg 4 80 
ccagttgtcc tattcacagc ctctggtgga gcccgtatgc aggaaggaat catgagtctc 540 
atgcagatgg ptaagatctc tgcggcggtt aaacgccatt caaatgctgg tctcttttac 600 
ctgaccattt tgacagatcc aacgactggt ggtgtgacag cttctttcgc tatggaaggc 660 
gatatcattc tggctgaacc acagagcttg gttggttttg ctggacgtcg tgtgattgaa 720 
aatacggttc gtgaaagctt gcctgaggat ttccaaaagg cagaattcct attagaacat 780 
ggctttgtgg atgctattgt caaaagaaga gacttaccag atacgattgc tagcctagtc 840 
agattgcatg gagggagtcc tagatga • 8 67 

<210> 64 
<211> 420 
<212> DNA 

<213> Streptococcus pneumoniae 
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<400> 64 

atgagaatta tgggattgga cgtcggttca aaaacggtag gggtggcgat tagcgatccg 60 
cttggtttta cagctcaagg gcttgaaatc atccagataa, atgaagaaca' aggccaattt 120 
ggttctgacc gcgttaagga attggttgat acttacaagg tggaacgatt tgtagtgggc 18o' 
ttgcctaaaa acatgaacaa tacaagtgga ccgcgcgtag 1 aagctagtca agcatacgga ,240 
gcaaagctag aagagttttt tggtttacca .gtagactatc aggatgaacg cttgacaaca 300 
gtggctgctg agcgcatgtt gattgaacaa gcagatatca gtcgcaataa gcgcaagaaa 360 
gtcattgata agttagcagc tcagctgatt ttacaaaatt atttagatag aaaattttaa 420 

<210> 65 ' • r , 

<211> 1197 
<212> DNA 

<213> Streptococcus pneumoniae 

t 1 
<4 00> 65 

atggcaaaac ttactgttaa agacgttgac ttgaaaggta aaaaagtcct cgttcgtgttt 60 
gacttcaafcg taccattgaa agatggcgtla atcactaacg ataaccgtat cacagcagct 120 
cttccaacta ttaagtacat catcgaacaa ggtggacgtg caattctttt ctctcacctt 180 
ggacgtgtga aagaagaagc tgataaagct ggtaaatcac ttgctcctgt ! agcagcagac 240 
ttggcagcaa aacttggtca agatgttgtt* ttcccaggtg tcactcgtgg tgctgaattg 300 
gaagcggcaa tcaacgctct tgaagatgga caagttctct tggttgaaaa cactcgttac 360 
gaagatgttg acggcaagaa agaatctaaa aacgatcctg aacttggtaa atactgggca 420 
tcacttggag atggtatctt cgtaaacgajt <gcattcggta cagctcaccg tgcacacgca 480 
tctaacgttg gtatctcagc aaacgttgak aaagcagttg ctggtttcct tcttgaaaac ^40 
gaaattgcct acatccaaga agcagttgaa actccagaac gtccattcgt ggctatcctt 600 
ggtggttcaa aagtttcaga caagatcggt gttatcgaaa acttgcttga aaaagctgat 660 
aaagtcctta tcggtggtgg gatgacttac acattctaca aagcacaagg tatcgaaatc 720 
ggtaactcac ttgtagaaga agacaaattg gatgttgcga. aagctcttct tgaaaaagca 780 
aatggtaaat tgatcttgcc agttgactca aaagaagcta acgcatttgc tggttacact 840 
gaagtgcgtg acactgaagg tgaagcagtt tctgaaggct tccttggtct tgacatcggt 900 
ccaaaatcta tcgccaaatt tgacgaagct ttgactggtg ccaaaacagt tgtatggaac 960 
ggacctatgg gtgtatttga aaacccagat ttccaagctg gtacaatcgg tgtgatggac 1020 
gctatcgtga aacaaccagg agttaaatca atcatcggtg gtggtgactc agctgccgca 1080 
gcgattaacc ttggccgtgc agacaagttc tcatggatta gtacgggtgg tggagcatca 1140 
atggaacttc ttgaaggtaa ggttcttcca caacttgcag ccttgacaga aaaataa 1197 

<210> 66 
<211> 498 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 66 ' 

atgttaaaat cagaaaaaca atcacgttat caaatgttaa atgaagaatt gtccttccta 60 
ttggaaggcg aaaccaatgt tttggctaat ctttccaacg ccagtgctct cataaaatca 120 
cgttttccta ataccgtatt tgcaggcttt tatttgttcg atggaaagga attggtttta 180 
ggccccttcc aaggaggtgt ttcctgcatc cgtattgcac taggcaaggg tgtttgtggt 240 
gaggcagctc actttcagga aactgttatt gttggagatg tgacgaccta tctcaactat 300 
atttcttgtg atagtctagc taaaagtgaa attgtggtgc cgatgatgaa gaatggtcag 360 
ttacttggag ttctggatct ggattcttca gagattgagg attacgatgc tatggatcga 420 
gattatttgg aacaatttgt cgctattttg cttgaaaaga cagcatggga ctttacgatg 480 
tttgaggaaa aatcttaa 4 98 

<210> 67 
<211> 630 
<212> DNA 

<213> Streptococcus pneumoniae 
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<400> 67 . , ' ' 

atgacaatcg aactattgac tccctttacc aaggtagagt tggagccaga' aatcaaggag |50 

aaaaaacgca-'aacaagttgg gattttaggg gggdatttta accctgttc^ ! caatgcccat 120 

ctcattgttg cggatcaagt acggcaacag ttgggactgg dbcaagttct gctcatgcct 180 

gaataccaac ctcctcacgt tgataaaaag gaaaccatcc' ctgaacacca tcgtctcaag 240 

atgcttgagt tggcaattga gggaattgac ggcctagtca ttgaaaccat tgagttggag 300 

cgcaagggta tttcctacac ctacgacacc atgaagattt tgacagagaa gaatccagat 360 

acggattatt actttatcat cggtgccgac atggttgact atctgcctaa gtggtaccga 420 

attgatgaac tggttgac^t ggttcagttt gtgggggttc agcgtccacg ctacaaggta 480 

gggacttcct atccagttat ,-ctgggtggac gtaccgctca tggatatctc gtccagcatg 540 

gtgcgtgcct tccttgccca aggtcggaaa cccaactttc tcctacctca gccagtgcta 600 

gactacatcg . a f gaagcjaggg gctctactga'- ' 1 - 630 

<210> 68 ' i ' > 

<211> 768 . 
<212> DNA ' , 

<213> Streptococcus pneumoniae ■ , 

<400> 68 - ' 

atgaatattg ( caaaaatagt cagagaagcg cgtgagcaga gtcgcttgac aaccttggac 60 

tttgcgacag gcatttttga tgaatttatc caattacatg gtgaccgttc ttttcgtgat 120 

gatggtgcag 'ttgttggtgg tattggttgg cttggagacc aagctgtaac agtggttggt 180 

atccaaaaag gcaagagttt gcaagacaac ctcaaacgga attttggcca accacatcca 240 

gaaggctacc gaaaggcact gcggttgatg aaacaggctg agaaatttgg ccgtccagtt 300 

gtgaccttta tcaatacagc aggtgcttat cctggtgtcg gagcggaaga acgtggtcaa 360 

ggggaagcta ttgctcgcaa tctcatggaa atgagtgacc tgaaagttcc tattatcgcc 420* 

attattatcg gtgaaggtgg ttcaggcggg gctctggctc tagctgtcgc ggaccgtgtc 480 

tggatgctgg aaaattctat ctatgccatt ctcagtccag aaggctttgc ttccatttta 540 

tggaaggacg gtactcgcgc catggaagca gcagaactga tgaaaatcac ttcgcatgaa 600 

ctgttagaaa tggacgtggt ggataaggtg atttctgaag taggactttc tagtaaagaa 660 

ctgattaaga gtgtcaaaaa agaactccaa acggagctgg ctagactttc acaaaaaccg 720 

ctagaagagt tgctggaaga acgctatcaa c'gatttagaa aatactaa 768 

<210> 69 
<211> 510 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 69 

atgattataa aagtagaaat ggcagatgtt gaggtgttgg ctaaaattgc caaacaaacc 60 
tttcgtgaaa cctttgcgta tgataatacg gaagagcagt tacaggaata ctttgaagag 120 
gcttatagtc tgaaaacttt gtcaactgag ttgggaaatc ctgactctga aacctatttc 180 • 
attatgcatg aggaggagat agctggtttt ctcaaagtca actggggaag tgctcaaact 240 
gagagagaat tagaggacgc ttttgaaatt caacgcctct atgtgctaca aaaattccaa 300 
ggatttggac taggtaagca actgtttgaa ttcgcacttg aacttgctac aaaaaatagt 360 
ttttcttggg ct'tggctagg tgtttgggag cataatacaa aagctcaagc cttttataat 420 
cgatatggtt ttgaaaaatt tagccaacat cattttatgg ttggtcaaaa agtagatacg 480 
gattggttac tgagaaagaa attaaggtaa * * 510 

<210> 70 
<211> 1590 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 70 

atgttacggg ggactgcttt gctaacggct agtaacttta. tcagtcgcct actcggggct 60 
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gtttacatta, tcccttggta catctggatg ggggcttatg cagctaaggc aaatggtctc 120 
tttaccatgg gttacactat ctatgcttgg ttcttgttgg : tttcaacagc ggggattcca 180 
gttgcggtgg ccaagcaagt tgccaagtat aataccatgc gagaagaaga gcatagcttt 240 
gccctgattc ggagcttctt aggctttatg acaggactag, gcctggtttt' tgctttagtc 300 
ttgtatgtct ttgctccttg gctagcagac ttgtctggcg tgggcaaaga cttgatccca -360' 
atcatgcaaa gcttggcttg gggagtcttg attttcccgt < ctatgagtgt tatccgagga ,420 
tttttccaag ggatgaataa cctcaaaccc tatgccatga gccaaattgc tgagcaggtc 480 
attcgtgtta tctggatgct cctagcaacc tttatcatta tgaagctcgg ttcaggagat 540 
tatctagcag ccgttaccca atcaaccttt gctgcctttg .tcggtatggt agccagtttt 600 
gcagtcttga tttatttcct tgcccaagaa agttcactca 1 aaagagtctt tgaaac^gga 660 
gataagatta acagtaagcg tctcttggtt gataccatta aggaagccat tccttttatc 720 
ctgacagggt ctgccatcca gatcttccag attttggatc agctgacctt t'atcaatagt 780 
. ' atgagctggt ttaccaacta cagcaatgag gacttggttg tcatgttttc ttatttctca ' 840 
gccaatccta ataaaatcac gatgattttg atttctgtag gggtttcgat tgggagtgtt 900 
15 ggtttgccac ttttgacgga aaacta'tgt^c aagggggact tgaaagcggc ttctcgtctc 960 
gttcaggaca gtctcaccct actctttatg ttcttgctac cagcaacggt tggagtggtt 1020 
atggtaggag aacctcttta tacggtcttc tatggtaagc cagatagttt ggctctggdjc' 1080 
ttatttgtct ttgcagtttt gcagtctatt attttaggct tgtacatggt cttgtctcca 1140 
atgcttcagg ccatgttccg caaccgcaag gccgttctct attttatcta^ tggttctatt 1200 
gccaagctag tcttgcaact acctaccatc gccctcttcc acagttatgg ' tcctttgatt 1260 
tcaacaacca ttgctctcat cattcctaac gtcttgatgt atcgggatat ttgtaaagta 1320 
actggtgtca agcgcaaggt gattttgaag cgaaccattt taatcagttt gctgacccta 1380 
gtcatgtttc tgttaatagg aaccatccag tggctgttag gatttttctt ccaaccaagt 1440 
ggacgtttgt ggagcttctt ttatgtagqt cttgtcggtg ccatgggggg tggactttat 1500 
25 atggttatga gtctgcgtac ctatttatta gataaggtaa taggaaaagc ccaagcagat ^L560 
•cgcctgcgag caaaatttaa gctttcgtaa 1590 

<210> 71 
<211> 468 
30 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 71 

atgtcagata agattggctt attcacaggc tcatttgatc cgatgacaaa tgggcatctg 60 

35 gatatcattg aacgggcgag cagactcttt gataagctct atgtcggtat tttttttaat 120 

ccccacaaac aaggatttct tcctatcgaa aatcgtaaac gggggctaga aaaggctttg 180 

ggacatctgg aaaatgttga agtcgtggct tctcatgatg aattggtggt cgatgttgca 240 

aaaagattgg gtgctacttg tctagtgcgt ggtttgagga atgcgtcgga tttgcaatat 300 

gaagccagtt ttgattacta caatcatcag ctgtcttctg atatagagac tatttattta 360 

catagtcgac ctgaacatct ctatatcagt tcatcaggcg ttagagagct tttgaagttt 420 

ggtcaggata ttgcctgcta tgttcccgag agtatttgga ggaaataa 4 68 



40 



<2i0> 72 
<211> 432 
45 <212> DNA 

<213> Streptococcus pneumoniae 



50 



<400> 72 

atgacgattt tgtttgtggt tatcagtgct tcctttctgt atatggtttc tcttagcatg 60 
aaaccctatc aaacagctaa aagtgaagga gaaaaattag ctcagcagta tgcaggatta 120 
gagcaggccg atcaggttga tttatacaat ggcttggaat cttattacag cgttcttggt 180 
cgtaataaac agcaagaagc • acttgctgtt ctgattggaa aagatgatca taagatttac 240 
gtttatcagc taaatcaggg tgtttcacaa gaaaaagcag aaacggtttc taaggaaaag 300 
ggagctggcg aaattgacaa gattatcttt ggtcgttatc aagataagcc aatctgggaa 360 
55 gtcaagtcag gatctgattt ttatctagta gattttgaaa caggagcatt ggtcaacaag 420 
gagggcctat ga 432 
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<210> 73 
<211> 732 

<212> DNA , , ■ . • 

<213> Streptococcus pneumoniae , , 1 , ' 1 ■ ' ' '' , , 

•<400> 73 < > ' ' / ■ ' 

atgattgata ttcattcgca cattgtcttt gatgtagatg atggtcccaa gtcaagagag 60 

gaaagtaagg ctctcttgac agaagcctac aggcaggggg tgcgaaccat tgtctctacc 120 

tctcaccgtc gcaagggcat gtttgaaact ccagaagaga agatagcaga aaactttett 180 

caggttcggg aaatagctaa ggaagtcgcg agtgacttgg tcattgctta tggggctgaa 240 

atttactaca cgccagatgt tttggataag ctggaaaaca atcggattcc, gaccctcaat 300 

aa'tagtcgtt atgccttgat agagtttajgt atgaacactc cttatcgcga tattcatagt 360 

gccttgaata aaatattgat gttgggaajtt actcccgtca ttgcccacat ' agagcgctat 420 

gatgttcttg aaaataatga aaaacgcgtt 1 cgagagctga tcgatatggg ctgttacacg 480 

caaataaata gttcacatgt cctcaaatcc aaactttttg gagaacctta ta'aattcatg 540 

aaaaaaagag cgcagtattt cttggagcgt gatttggttc atatcattgc aagtgatatg 600 

cataatgtgg acggcagacc cccccatatg gcagaagcat atgaccttgt ttcccaaaaa ' 660 

tacggagaag cgaaggctca ggaacttttt atagacaatc ctcgaaaaat tgtaatggat 720 

caactaattt ag t 732 



<210> 74 
<211> 927 
<212> DNA 

<213> Streptococcus 



pneumoniae! 



<400> 74 

atgtctacaa tcgataaaga aaaatttcag tttgtaaaac gtgacgattt tgcctctgaa 60 

actattgatg cgccagcata ttcttactgg aaatcagtgt ttaaacaatt tatgaagaaa 120 

aaatoaactg tagtcatgtt gggaatcttg gtagccatca ttttgataag tttcatctac 180 

ccaatgtttt ctaagtttga tttcaatgat gtcagcaagg taaacgactt tagtgttcgt 240 

tatatcaagc caaatgcgga gcattggttc ggtactgaca gtaacggtaa atcgctcttt 300 

gacggtgtct ggttcggagc tcgtaactcc atcctcattt ctgtgattgc gacagtgatt 360 

aacttaatta tcaatatttt tatcaqtaat atttqaqata tttcaaaatc aqttaaccat 420 



cttggtta tcggtgtttt tgtcggtggt atttggggta tttcaaaatc agttgaccgt 
gtcatgatgg aagtttacaa cgtcatctca aacatcccac ctcttttgat tgttattgtc 4 80 
ttgacttact caatcggagc tggattctgg aatctgattt ttgccatgag cgtaacaaca 540 
tggattggta ttgccttcat gatccgtgtg caaatcttgc gctatcgtga cttggaatac 600 
aacttggcgt cacgt-acttt gggaacacca accttgaaga ttgttgccaa aaatatcatg .660 
cctcaattgg tatctgttat tgtgacaacc atgactcaaa tgcttccaag ctttatctca 720 
tacgaagcct tcttgtcttt cttcggtctt ggattaccga ttacagtgcc aagtttgggt 780 
cgtttgattt cggattattc acaaaacgta acaaccaatg cttacttgtt ctggattcca 840 
ttgacaaccc ttgtcttggt atccttgtcc cttttcgtag ttggtcaaaa cttagcggat 900 
gctagtgatc cacgtacaca tagatag 927 

<210> 75 
<211> 234 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 75 

* 

atgtataacc tattattaac cattttatta gtattatctg ttgtgattgt gattgcaatt 60 
ttcatgcaac caaccaaaaa ccaatccagc aatgtatttg atgccagttc aggtgatttg 120 
tttgaacgca gtaaagctcg cggttttgaa gctgtaatgc agcgtttgac agggatttta 180 
gtctttttct ggctagccat tgccttagca ttgacggtat tatcaagtag ataa 234 

<210> 76 
<211> 1110 
<212> DNA 
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<213> Streptococcus pneumoniae 
<400> 76 

atgtttcgta gaaataaatt att ( tttttgg accacagaaa tttfcactctt , aaccatcatc 60 
ttttacctat ggagacagat gggatctttg attaaccctt ttgttagcgt gcttaataca 120 
attatgattc catttttatt agggggcttt ctttattatt igacaaaccc tattgttact 180 
ttcttaaata aagt'ctgtaa actcaatcgt ttgcttggta' ttttaattac cttgtgtact 240 
ttggtctggg gaatggtcat aggtgttgtc tatctcttac ctattttgat taatcagtta 300 
tctagtttga ttatatctag tcaaactatt tatagtcgag tacaagactt aatcatagac 360 
ttatctaatt atcctgcgct ccagaatttg gatgtagaag ctacaattca gcagttaaac 420 
ttatcctatg ttgatattct tcaaaatatc ctaaatagcg tatcaaatag tgtggggagc 480 
gtcttgtcag ctcttatcag> tactgttttg attttgatta tgactcbagt ttttttggtt 540 
tatttcttat tagatggaca taaattcttg cccatgcttg aaagaacgat tctaaagagg 600 
gatcgcttgo atattgcagg cttattaaag' aatttaaatg cgacgattgc tcgctatatt 660 
agtggagttt cgattgacgc aatcattata ggttgtttgg . cttatattgg ctatagtatt 720 
attggtttaa aatatgbttt . agtttttgcc attttttctg' gtgtagccaa tttaattcct 780 
tatgtggggc caagtattgg tttgattcct atgatca.tcg ' caaatatatt cactgtaccc 840 
catagactgc tgattgcagt gatttatatg cttgttgttc agcaggtaga tggcaatatc 900 
ttatatcctc gaattgtagg aagtgttatg aaggttcatc caatcacgat tttagtttta 960 
cttttgttgt caagcaatat ctatggtgta gttggaatga ttgtcgcagt gccaacctat 1020 
tctatcttga aagaaatttc taagttctta tcccgtttgt atgaaaatca taaaataatg 1080 
aaagaacgagi aaagagaatt agctaagtaa t 1110 

<210> 77 
<211> 1356 
<212> DNA 

<213> Streptococcus pneumoniae - , 

<400> 77 

atgtatcaag cactttatcg aaaatataga agtcaaaact tctcccagtt agttggtcaa 60 
gaagttgtgg ctaagactct taaacaagcg gtggagcaag agaaaataag tcacgcttat 120 
cttttttctg gtcctcgtgg aacgggaaaa accagtgttg ctaaaatctt tgccaaggct 180 
atgaactgtc ccaatcaagt gggtggcgaa ccttgcaata actgctatat ttgtcaagca 240 
gtgacggacg gtagtttaga agatgtcatt gaaatggatg cagcttctaa taatggggta 300 
gatgaaattc gcgaaattcg tgataaatct acctatgcgc ctagccttgc tcgttataag 360 
gtttatatca tagatgaggt tcacatgctg tctacagggg cttttaatgc cctcctaaag 420 
acgctggaag aaccaacaca gaatgtagtc tttatttt'gg ccactactga attgcacaag 480 
attcctgcta ctattctatc ccgtgtgcaa cgttttgagt ttaaatcaat taagacacag 540 
gatattaagg aacatattca ctatatctta gaaaaagaaa atatcagttc tgaaccagag 600 
gctgtggaaa tcattgccag acgggcggaa ggtggaatgc gggacgcctt gtctattttg 660 
gatcaagccc tgagtttgac acagggaaat gagctgacga ctgctatctc tgaagaaatt 720 
actggcacca ttagcctatc agccttggat gattatgtgg cggccttgtc tcaacaggat 7 80 
gttcccaaag. ctttgtcttg cttgaatctt ctttttgaca atggtaagag catgactcgt 840 
tttgtgaccg atcttttgca ctatttaaga gacttgttaa ttgttcaaac agggggagaa 900 
aatactcatc atagttcagt ctttgtagaa aatttggcac ttcctcaaaa aaatctgttt 960 
gaaatgattc gcttagcaac agtgaattta gcagatatta agtctagttt gcagcccaag 1020 
atttatgctg aaatgatgac cgtccgtttg gcggaaatca agcccgaacc agctctatca 1080 
ggagcggttg aaaatgaaat tgctacgctg agacaggaag ttgcccgtct caaacaagag 1140 
ctttctaatg caggtgcggt tcctaaacaa gttgcaccag ctcctagtcg accagctacg 1200 
ggcaaaacag tctatcgtgt cgatcgcaat aaagtgcaat ctatcttaca agaggccgtc 1.260 
gaaaatcctg atttaacacg tcaaaatcta attcgtttgc agaatgcatg gggagaggta 1320 
attgaaagtc taggtgggcc ggacaagctc tgctag 1356 

<210> 78 
<211> 1989 
<212> DNA 

<213> Streptococcus pneumoniae 
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<400> 78 

atgtttcgat taaccaataa gttagcggta tcgaacttga ttaaaaaccg caaactctac ,60 
tatccttttg cgctggctgt tctcttggca gtcactctca cctatctct;t' ttactctcta 120 
5 accttcaatc ctaagattgc ggaaatccgt ggaggaacaa ccattcaggc tacacttgga 180 
tttggtatgt ttgtcgtcac ccttigcgtca gccattatccj ttctctatgc qaatagtttt 240 
gtcatgaaga aacgttccaa ggaactagga atttatggca tgttgggctt ggagaagcgt 300 
catcttatca gtatgacctt taaggagtta gtggtatttg ggattctaac tgttggagcg 360 
ggtatcggta ttggagcctt gtttgacaag ttaattttcg ctttcctgct caaactaatg 420 
10 aaattgaagg ttgagctggt tgctaccttc cagacgaaag ttgtcattac agtgcttgtt, 480 
gtcttcggtt tgattttcct, aggcctcatg ttcctgaatg- cccttcgaat cgcccgtatg 54 0 
aatgccctcc agctctctcg tgagaaagct agtggagaga aaaaaggtcg cttccttcct 600 
ctccaaacca ( ttcttggttc cataagttta* gdjaattggct attatcttgc ccttacggta 660 
aaagatcctc ttacagcctt aacaaccttc ttcatagctg ttttactggt tatctttggg 720 
• acttatctct tgtttaatgc agggattacc gttttcctcc 1 aaatcttaaa gaaaaataag 780 
aaatactatt accaaccaaa taacctcata tctgtttcta acttgatttt ccgtatgaag 840 
aaaaatgcag ttggactagc aactatcgct attttgt'caa caatggtttt ggtaaccatg 900 
tcagcagcga caagcatttt caattccgca gaafcccttta aaaaagttct aaatcctcat 960 
gattttgggg tttcaggaca aaatgttgaa a'aagaagatt tggacaaact cttgagccag 1020 
20 tttgcaagtg acaatggtta taagattaaa gaaaaagaag tgtttcgtta cacttacttt 1080 
ggtgttgcga accaagaagg aaataagtta accttttttg aaaaaggaca aaatcgtgtc 1140 
caacccacaa cagttttcat ggtatttgac caaaaagatt atgaaaatat gactggtcaa 1200 
aaactgtctc tatcaggaaa tgaggtcggt ctctttgcca aaaatgacgg actgaaagga 12 60 
cagaaaactc taattctgaa tgatcatcaa ttttctgtaa aagaagaatt taataaagat 1320 
25 tttattgtca accatgtccc aaatcagttt aatattttga ctgctgatta caattacctt 1380 
gttgtacctg atttacaagc ctttttgaac caattcccag attcggatat ctataatcag 1440 
ttttacggtg gtatgaatgt aaatgtcagt gaagaagaac aactcaaggt cgctgaggag 1500 
tatgaaaact acctcaatca atttaatgct, caattagaca cagaaggtag ctatgtttat 1560 
ggtagcaatc tagcagatgc tagttctcag atgagtgccc tctttggtgg tgtcttcttt 1620 
30 atcggtattt tcctatccat tatctttatg gtcggaactg ttctggtcat ctactacaaa 1680 
caaatttctg aaggctacga agaccgtgaa cgctttatta tcttgcagaa agtcggtttg 1740 
gaccaaaagc aaatcaagca aaccatccac aaacaggttt taactgtttt cttccttcct 1800 
ttgctctttg ccttcataca tctcgccttt gcctaccata tgcttagcct gattttaaaa 1860 
gtgattggtg tactggatac gactatgatg ttgattgtga ccttgtctat ctgcgctatc 1920 
35 ttcctcatcg cctatgtgct gattttcatg attacttcaa gaagttatcg caagattgtg 1980 
caaatgtaa 1989 



<210> 79 

<211> 891 

40 <212> DNA 

<213> Streptococcus pneumoniae 
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<400> 79 

atgaaacaag atcaactaaa ggcttggcaa ccagctcagt ttgaccgttt tgtccgtatc 60 
ttagaacaag accagctcaa tcacgcctat ctcttttcag gtttctttgg aagcttggaa 120 
atggcgcaat ttttagctaa gagcctcttt tgtacggata aagttggcgt cttaccatgt 180 
gagaaatgcc gaagttgcaa gctgattgaa caggaagagt ttccagatgt caccttgatt 240 
aagccagtca atcaggtcat caagacagaa cgcattcggg aattggtggg acagttttct 300 
caagcaggga ttgaaagcca gcaacaggtc tttattatcg agcaagcgga taaaatgcat 360 
cccaacgcag ccaattctct gctcaaggtc atcgaagaac cccagagtga agtttatatt 420 
ttcttcttga ctagcgatga ggaaaagatg ttaccgacaa tccgaagtcg gactcagatc 4 80 
ttccacttta aaaagcaaga agaaaaactt atcttactct tagaacaaat gggacttgtt 540 
aagaaaaaag cgactctttt agctaagttt agtcaatcgc gagctgaagc agaaaagttg 600 
gctaatcagg caagtttttg gaccttggtc gatgaaagtg aacgcctgct gacttggtta 660 
gtagctaaga aaaaagaaag ttatctacag gttgccaaat tagccaactt ggcagatgat 720 
aaggaaaaac aggatcaggt tttacggatt cttgaagttc tctgtgggca ggacctcttg 780 
caggtaagag taagagtgat tctacaagat ttactagaag ctagaaaaat gtggcaagct 840 
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aatgtcagct ttcaaaatgc catggaatat ctggtcttga aagaaatata a 891 
<210> 80 

<2U> 615 , t a - - 

,<212> DNA • 1 ' , ■ ' ' , .' ■ . , " ' 1 
<213> Streptococcus pneumoniae . i 

<400> 80 ' . 

atgaattcat ttaaaaattt cttaaaagag tggggactgt tcctcctaat tctgtcatta 60 

ctagctttaa gtcgtatctt tttttggagc aatgttcgcg tagaagcjaca ttcca^ggat 120 

ccgaccctag cggatggcga aattctcttc gttgtaaaac accttcctat tgaccgtttt 180 

gatatcgtgg tggcccatga ggaagatggc aataaggaca ,tcgtcaagcg' cgtcjattgga 240 

atgcctggcg acaccattcg ttacgaaaat .gataaactct acatcaatga 1 caaagaaacg 300 

gacgagcctt atctiagcaga ctatatca'aa, cgcttcaagg atgacaaact ciaaagcact 360 

15 tactcaggca agggctttga "aggaaatajaa ggaactttct ttagaagtat cgctcaaaaa 420 

gcccaagcct tcacagttga tgtcaactac * aacaccaact ttagctttac tgttccagaa 480 
' ggagaatacc ttctcctcgg agatgaccgc ttggtttcga gcgacagccg ccacgta^gt 1 540 

accttcaaag caaaagatat cacagggg'aa gctaaattcc gcttctggcc aatcaccccjrt 600 

atcggaacat tttaa ' 615 



<210> 81 i ' 

<211> 987 i 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 81 | , 

atggtagtat ttacaggttc aactgttgaa gaagcaatcc agaaaggatt gaaagaatta 60 

gatattccaa gaatgaaggc tcatatcaaa gtcatttcta gggagaaaaa aggctttctt 120 

ggtctatttg gtaaaaaacc agcccaagtg gatattgaag cgattagtga aacgactgtt 180 

30 gtcaaagcaa atcaacaggt agtaaaaggc gttccgaaaa aaatcaatga ,tttgaacgag 240 

cctgtgaaga cggttagtga agaaaccgtt gaccttggtc atgtggttaa tgctattaaa 300 

aaaatagagg aagaaggtca aggtatttct gatgaagtca aggctgaaat cttaaaacat 360 

gaaagacatg ccagcactat cttagaagaa actggtcaca ttgagatttt aaatgaactt 420 

caaatcgagg aagcgatgag ggaagaagca ggcgctgatg accttgaaac tgagcaagat 480 

35 caaactgaaa atcaagactt gaaagagatg ggcttgaagg tcgagcaaag ttatgatatt 540 

gcccaggtgg ctacggatgt gactgcctat gttcaagcga ttgtggatga catggatgtt 600 

gaagctacac tttcaaatga ttataaccgt cgtagcatca atctacaaat tgacaccaac 660 

gaaccaggtc gtattatcgg ctaccatggt aaagtcttga aggccttgca actgttggct 720 

caaaattatc tttacaaccg ctattccaaa accttctacg ttacaatcaa tgtcaatgat T80 

40 tatgtcgaac accgtgcaga agtcttgcag acctatgcgc aaaaattggc gaatcgtgtt 84 0 

ttggaagaag gtcgcagtca taaaacagat ccaatgtcaa atagcgaacg caagattatc 900 

catcgtatta tttcacgtat ggatggcgtg actagttact ctgaaggtga tgagccaaat 960 

cgctatgttg ttgtagatac agaataa 987 

45 <210> 82 

<211> 1383 
<212> DNA 

<213> Streptococcus pneumoniae 
50 <400> 82 

atgtcaaatt, ttgccattat tttagcagcg ggtaaaggga ctcgcatgaa atctgatttg 60 
ccaaaagttt tgcacaaggt tgcgggtatt tctatgttgg aacatgtttt ccgtagtgtg 120 
ggagctatcc aacctgaaaa gacagtaaca gttgtaggac acaaggcaga attggttgag 180 
gaggtcttgg ctggacagac agaatttgtg actcaatctg aacagttggg aactggtcat 240 
55 gcagttatga tgacagaacc tatcttagaa ggtgtgtcag gacacacctt ggtcattgca 300 
ggagatactc ctttaatcac tggtgaaagc ttgaaaaact tgattgattt ccatatcaat 360 
cataaaaatg tggccactat cttgactgct gaaacggata atccttttgg ctatggacga 420 



32 



WO 01/49721 PCT/US00/35604 



attgttcgta atgacaatgc tgaggttctt cggtcattgt tgagcagaag gatgctacag 480 

attttgaaaa gcaaatcaag gaaatcaaca ctggtaacat acgtctttga caacgagcgt 540 

ttgtttgagg ctttgaaaaa tatcaatacc aataacgctc aaggcgaata ctatattaca 600 

gacgtcattg gtattttccg tgaaactggt gaaaaagttg gcgcttatac tttgaaaiga;b 66p 
5 tttgatgaaa gtcttggggt aaatgaccgt gtggcgcttg cgacagctga gtcagttatg 720 
cgtcgtcgca tcaatcataa acacatggtc aacggtgtta 1 gctttgtcaa tccagaagca. 780 

acttatatcg atattgatgt tgagattgct ccggaagttc aaatclgaagc caatgttatc 840 

ttgaaagggc aaacgaaaat tggtgctgag actgttttga caaacggtac ttatgta.gtg 900 

gacagcacta tcggagcagg agcggtcatt accaattcta tgattgagga aag'tagtgtt 960 

10 gcagacggtg tgacagtcgg tccttatgct cacattcgtd caaattcaag tctgggtgcc 1020 

caagttcata ttggtaactt tgttgaggtg aaaggatctt caatcggtga gaataccaag 1080 
gctggtcatt tgacttatat cggaaact.gt gaagtgggaa, gcaacgtbaa/ tttcggtgct, 1140 

ggaactatta cagtcaacta tgacggca'aa aacaaataca agacagtcat 1 tggagtcaat 1200 

gtctttgttg gttcaaattc aaccattatt gcaccagtag aacttggtga caattccctc 1260 

15 gttggtgctg gttcaactat tactaaagac gtgccagcag atgctattgc tattggtcgc 1320 
ggtcgtcaga tcaataaaga cgaatatgca acacgtcttc ■ ctcatcatcc taagaaccag 1380 

' tag | I i 1383 

j 

<210> 83 
20 <211> 936 
<212> DNA 

<213> Streptococcus pneumoniae! 

<4 00> 83 j 

25 atgtccaaga t.tctagtatt tggtcaccaa aatccagact cagatgccat cggatcatct , 60 

gtagcttttg, cctaccttgc aaaagaagct tacggtttgg atacggaagc tgttgccctt 120 

ggaactccaa atgaagaaac agcctttgtc ttgaactatt ttggtgtgga agcaccaqgt 180 

gttatcactt ctgccaaagc agagggggca gagcaagtta tcttgactga ccacaatgaa 240 

ttccaacaat ctgtatcaga tatcgctgaa gtagaagttt acggtgttgt agaccaccac 300 

30 cgtgtggcta actttgaaac tgcaagccca ctttacatgc gtttggagcc agttggatca 360 

gcgtcttcaa tcgtttaccg tatgttcaaa gaacatggtg tagctgttcc taaagagatt 420 

gcaggtttga tgctttcagg tttgatttca gatacccttc ttttgaaatc accaacaaca 480 

cacccaacag ataaaatcat tgctcctgaa ttggctgaat tggctggtgt aaacttggaa 540 

gaatatggtt tggcaatgtt gaaagctggt accaacttgg ctagcaaatc tgctgaagaa 600 

35 ttgattgaca tcgatgctaa gacttttgaa ctcaacggaa ataatgtccg tgttgcccaa 660 

gtgaacacag ttgacatcgc tgaagttttg gaacgccaag cagaaattga agctgcaatg 720 

caagctgcca acgaatcaaa cggctactct gactttgtct tgatgattac agatatcgtc 780 i 

aactcaaact cagaaatatt ggctcttggt gccaatatgg acaaggtcga agcggctttc 840 

aatttcaaac ttgaaaacaa tcatgccttc cttgctggtg ccgtttcacg taagaaacaa 900 

40 ■ gtggtacctc aattaactga aagctttaat acgtaa 936 

<210> 84 
<211> 678 
<212> DNA 
45 <213> Streptococcus pneumoniae 

<400> 84 

atgatttcaa agagattaga attggtagct tcctttgtgt cacagggggc tattttacta 60 

gatgtgggaa gtgaccatgc ttatctgcct atcgagttgg ttgagagagg ccaaatcaaa 120 

50 - agcgctattg caggtgaggt ggtggaaggt ccctatcagt ctgcggttaa aaatgttgag 180 

gctcacggcc taaaggagaa aatccaagtc cgtttagcca atggcttggc agcttttgaa 240 

gagactgacc aagtgtctgt cattaccatt gctggcatgg gtggtcgttt gattgctagg 300 

attttagaag aaggtttggg gaagttagct aatgtagagc gtttgatcct ccagcccaat 360 

aatcgtgaag acgacttgcg tatctggcta caggatcatg gattccagat tgtagcagaa 420 

55 agcatcttag aagaagctgg aaagttttat gagattttgg tggtggaagc aggacaaatg 4 80 

aagctatcag ccagtgatgt tcgctttggt cccttcttgt ccaaagaagt cagtccagta 540 

tttgtccaaa aatggcaaaa agaagctgag aagctagagt tcgccctcgg acaaatccca 600 
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i 1 

gaaaaaaatc tggaagaacg tcaagttcta gtagataaga ttcaagctat caagga,ggtg 660 

ctccatgtta gcaagtga , . : 1 678 

i 

+ H * * 

<210> 85 ii , , ' ; . 

<211> 486 ' , 1 , , ■ ' 

<212> DMA. ' 1 \ ' , , . 

<213> Streptococcus pneumoniae 1 ' . ■ •' 1 ' 

<400> 85 . * „ 

atgaatttaa acgatattaa agacttgatg actcaatttg accagtcaag tttgagagaa 60 
ttttcttata aaaatgggac ggatgagttg ca^tttagca agaatgaagc ,gagacctgtg 120 
cctgaagttg caactcaagt cgctccagca cccgttctag' caacaccgag ticcagtagct 180 
cctacatctg ctccagcaga gactgtagc^t craagaagttc cagctccagc tgaagcaagt 240 
gtggctagtgi agggaaatct tgtagagagt' ccacttgttg gagtggttta cttggctgct 300 
15 ggaccagata aacctgcctt cgttacagtt ggtgat'agtg, tcaaaaaagg tcaaacattg 360 
gtaattatcg aagccaitgaa. agtcatgaat gajaatcccag ctcctaagga tggtgtggta 420 
acggaaattc tcg.tctctaa cgaagaaatg gttgagtttg 1 gtaaaggatt ggtacgtatc 480 
aaatga 486 

20 <210> 86* 

<211> 1236 

<212> DNA i * f 

<213> Streptococcus pneumoniae 

25 <400> 86 

atgaaactaa atcgagtagt ggtaacaggt tatggagtaa catctccaat cggaaataca 60 
ccagaagaat tttggaatag tttagcaact gggaaaatcg gcattggtgg cattacaaaa 120 
tttgatcata gtgactttga tgtgcataat gcggcagaaa tccaagattt tccgttcgat 180 
aaatactttg taaaaaaaga taccaaccgt' tttgataact attctttata tgccttgtat 240 
gcagcccaag aggctgtaaa tcatgccaa't cttgatgtag aggctcttaa tagggatcgt 300 
tttggtgtta tcgttgcatc tggtattggt ggaatcaagg aaattgaaga tcaggtactt 360 
cgccttcatg aaaaaggacc .caaacgtgtc aaaccaatga ctcttccaaa agctttacca 420 
aatatggctt ctgggaatgt agccatgcgt tttggtgcaa acggtgtttg taaatctatc 480 
aatactgcct gctcttcatc aaatgatgcg attggggatg ccttccgctc* cattaagttt 540 
35 ggtttccaag atgtgatgtt ggtgggagga acagaagctt ctatcacacc ttttgccatc 600 
gctggtttcc aagccttaac agctctctct actacagagg atccaactcg tgcttcgatc 660 
ccatttgata aggatcgcaa tgggtttgtt atgggtgaag gttcagggat gttggttcta 720 
gaaagtcttg aacacgctga aaaacgtgga gctactatcc tggctgaagt ggttggttac 780 
ggaaatactt gtgatgccta ccacatgact tctccacatc cagaaggtca gggagctatc 840 
-aaggccatca aactagcctt ggaagaagct gagatttctc cagagcaagt agcctatgtc 900 
aatgctcacg gaacgtcaac tcctgccaat gaaaaaggag aaagtggtgc tatcgtagct 960 
gttcttggta aggaagtacc tgtatcatca accaagtctt ttacaggaca tttgctgggg 1020 
gctgcgggtg cagtagaagc tatcgtcacc atcgaagcta tgcgtcataa ctttgtacca 1080 
atgacagctg ggacaagtga agtatcagat tatatcgaag ctaatgtcgt ttatggacaa 1140 
45 ggcttggaga aagaaattcc atacgctatt tcaaatactt ttggttttgg aggccacaat 1200 
gcagttcttg ctttcaaacg ttgggagaat agataa 1236 

<210> 87 
<211> 1080 
50 <212> DNA 

<213> Streptococcus pneumoniae 

<400> 87 

atgaacatct atgatcaact acaagttgta gaagaccgtt atgaagaatt aggagaattg 60 

55 ctgagtgacc ctgatgtcgt ttcagacacc aagcgtttta tggagctttc aaaagaagaa 120 

gcttccaatc gtgacaccgt aatagcctac cgtgagtata aacaagtcct tcaaaatatc 180 

gtcgatgccg aagagatgat taaggaatca ggcgg'agatg cggacttgga agaattggcc 240 
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aagcaagaac tcaaagatgc caaggctgaa aaagaagaat atgaagaaaa actgaaaatt 300 
ttgctccttc caaaggatcc a'aacgatgac aagaatatca tccttgaaat ccgtggagca 360 
gctggtggag acgaagcggc acttttcgct ggagatttgc taactatgta ccaaaagtat 420 
gcggaagccc aaggttggcg ctttgaagtc atggaagcct ctatgaatgg tgtcggtggt 480 
5 tttaaagaag tggttgctat ggtttcaggt cagtctgtat actctaagct taagtatgaa 540 
tcaggtgccc accgtgtgca acgtgttcct gtgacagaaa gccaaggccg tgtticatact 600 
tcgacagcga cagttcttgt tatgccagaa gttgaagagg ttgaatacga cattgatcca 660 
aaagaccttc gtgtcgacat ctatcacgcc tctggtgctg gtggacagaa cgtcaataag 720 
gttgcgactg ccgttcgtat cgttcacttg ccaaccaata tcaaggttga gatgcaggaa 780 
10 gaacgtaccc agcagaagaa ccgcgagaag gc^atgaaga ttatccgtgc acgcgtcgct 840 
gaccactttg ctcagattgc tcaggatgaa caagacgctg agcgtaagtc ijacaatcggt 900 * 
actggtgacc gttcagaacg gatccgaact tataacttcc cacaaaaccg tgtcacagac 960 
cacc9tatcg | gcttgaccct ccaaaaacta' (! gatacgattt tgtctggtaa attggacgaa 1020 
gttgtggatg ccttggtgct ttatgaccaa acacaaaaac tagaagaatt aaacaaataa 1080 
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50 



55 



<210> 88 
<211> 1680 
<212> DNA 

<213> Streptococcus pneumoniae 



<400> 88 

atggcctaca ctcttaaacc tgaagaagtt ggtgtttttg ccatcggtgg tctaggagaa. 60 
atcgggaaaa acacttacgg aattgaatac caagacgaga ttatcatcgt cgatgctggg 120 
attaaattcc cagaagatga cttgcttggt atcgactatg tcattcctga ctactcttac 180 
25 atcgtagaca atatcgaccg cgtcaaggct gttttaatca cacacggaca cgaggaccac 24 0 
attggtggga ttccgttcct actcaagcaa gcaaatgtcc ctatttatgc tggaccgctt 300 
gccttggctt tgatccgtgg gaaactcgaa gaacacggcc tcttgcgcaa cgccaaactt 360 
tacgaaatca accacaacac cgagttgacc tttaaaaatc tcaaggcaac tttctttaga 420 
acgactcact ctattccaga gcctttgggg attgtcattc atactcctca agggaaaatc 480 
30 gtctgtacgg gtgactttaa gttcgacttt actccagttg gagaacctgc ggacttgcat 54 0 
cgtatggctg cgcttggtga agaaggcgtg ctctgtctcc tgtctgactc gacaaatgcg 600 
gaagtaccaa cctttaccaa ctctgaaaaa gtcgttggtc agtccattat gaagattatc 660 
caaggtattg aaggacgtat catctttgca tcctttgcct caaatatctt ccgtctccag 720 
caggcaacag aagctgctgt taagactgga cgcaagattg cggtctttgg tcgttctatg 780 
35 gaaaaggcca ttgtcaacgg aatcgatctt ggctacatca aagctcctaa gggaaccttt 840 
atcgagccaa atgaaatcaa agattatcct gcaggagaag ttcttatcct ctgtacaggt 900 
agtcagggtg agcctatggc agccctctct cgtatcgcca acggaaccca ccgtcaagta 960 
caattacaac caggtgatac cgttatcttc tcttctagtc ccatccctgg aaacactact 1020 
agcgtcaaca agctgattaa catcatttct gaagctggtg tcgaagttat ccacggtaaa 1080 
40 gtgaacaata tccatacatc tggacacggt ggtcagcaag agcaaaaact catgctctgc 1140 
ttgattaagc caaaatactt catgcctgtc cacggtgaat accgcatgca aaaagtccac 1200 
gctggactag cagtggatac tggtgttgag aaggacaata tctttatcat gagcaatggc 1260 
gatgtgcttg cccttactgc tgactcagct cgtatcgcag gtcatttcaa cgcccaagat 1320 
atctatgtcg atggaaatcg tatcggtgaa attggcgcag ctgtcctcaa agatcgtcgc 1380 
45 gatctatctg aagacggtgt cgttctggca* gtcgcaactg ttgacttcaa atcgcagatg 1440 
attctgtctg gcccagatat cctcagccga ggctttgtct acatgagaga gtctggagac 1500 
ttgattcgcc aaagccagcg tatcctcttc aatgccattc gtatcgcact gaaaaataag 1560 
gatgctagcg tgcaatctgt caatggtgcc attgtcaacg ctattcgccc cttcctctat 1.620 
gaaaataccg aacgtgaacc gatcatcatc ccgatgatcc tcacaccaga tgaagaataa 1680 



<210> 89 
<211> 1362 
<212>* DNA 

<213> Streptococcus pneumoniae 
<400> 89 

atggcagaag tagaagagtt acgagtacaa cctcaagata tcttagctga gcaatccgtt 60 
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ttaggggcta tctttattga tgagagtaaa cttgtttttg tgcgagaata cattgagtct 120 
cgggactttt ttaagtatgc ccatcgtttg attttccaag ccatggtcga tttatccgat 180 
cgtggtgatg ccatagatgc aacaacggtt cgtactatcc ttgataatca aggtgattta 240 
cagaatattg gtggcttgtc ttacttggtt gagattgtta. attctgtgcc aacttct^ct 36q 
aatgcggagt attatgctaa gattgttgca gaaaaagcaa tgctacgtbg tttaattgpc 360 
aagttgacag agtctgtcaa ccaagcttac gaagcgtcac 1 aaccagbtga tgaaattatt . 420 
gctcaggcag aaaaagggtt gattgatgtc agtgaaaatg caaatcgaag cgggtttaag 480 
aacattcgag atgtgttgaa tctcaacttt ggaaatctgg aagctcgctc gcaacaaacg 540 
accgatatta caggtattgc gacaggttat cgtgatttgg atcatatgac aacaggactt 600 
catgaggagg agttgattat cttagcagct cgtccagcag ttggtaagac agcatttgcc 660 
ttgaatatcg ctcagaacat tgggactaag ttggacaaaa cggttgctat tttttcactc 720 
gaaatgggtg cggaaagctt ggtagatc^t atgttagctg cagaaggctt ggtggagtca ( 780 
cattctatcc gtacagggca attgacagkt gaggagtggc aaaaatatac tattgctcag 840 
ggtaatctag ctaacgccag tatctatatc gatgatacgc caggtattcg gattacagag .900 
attcgttctc gttctcgtaa attggctcaa gaaactggaa atcttggttt gattgtgata 960 
gactatttgc aacttatcac gggaactggt cgagaaaatc gtcaacaaga agtttctgaa 1020 
atttctcgtc agttgaaaat actagccaag gaattgaagg ttccagtaat cgctctga^t 1080 
cagctttctc gtggtgtaga acaacgtcag gacaagagac cggtcttgtc tgatattcgt 1140 
gaatctgggt ctattgagca ggacgctgat atcgtagctt ttctctatcg cgatgactac 1200 
tatgaacgtg gtggtgaaga agaggagggt atcccaaata ataaggtgga agttattatc 1260 
gagaaaaacc gtagtggagc tcgtggaaca gtggaattga ttgtccaaaa agaatacaat 1320 
aaattcttca agtattcaag tatctcaaag agggaggcat aa 1362 

<210> 90 I , 

<211> 693 I 

<212> DNA 

<213> Streptococcus pneumoniae 



<400> 90 

atggcgtata aatatttagt gattgtagaa tcacctgcaa aagccaagac aattgagaaa 60 

tatcttggac gaaactataa agtaatggcc agtgttggcc atataagaga tttaccgaaa 120 

agtaaaatgg gtatcgattt tgaaaacaat tatgaacccc attatatttc tatacgcgga 180 

aaaggcgatg tcatcaaaag cctgaaagcc gcagcaaaaa aagctcaaaa agtttacttg 240 

gcaagtgacc cggatagaga aggagaagcg attgcttggc atttagcgta cctacttggg 300 

ttggatctga aagaaaaaaa tcgggtggtc ttcaatgaaa tcacaaaaga cgcagtcaaa 360 

gcagctttta aggaaccaag aacgatcgat gtagatttag tagatgcaca gcaagctcgt 420 

cgtaccttag acagaatcgt tggttattcg atcagtccta ttctctggcg taaggtcaag 480 

aaagggttaa gtgcaggacg tgtccaatct gtcgctttaa aaattattat tgaccgtgaa 540 

aaagagatcc gagaatttgt tccagaagaa tattggagca tcgacggtaa ttttaaaaaa 600 

gctcgcaaga aattcaaagc aaatttctgg ggaatcgacg gtaagaaaaa gaaattacca 660 
gatgcacaaa gtgtaaaaag aagtcactgc tag 693 

<210> 91 ' 
<211> 981 
<212> DNA 

<213> Streptococcus pneumoniae 



60 



<400> 91 

atgtttattt ccatcagtgc tggaattgtg acatttttac taactttagt aggaattccg 

gcctttatcc aattttatag aaaggcgcaa attacaggcc agcagatgca tgaggatgtc 120 

aaacagcatc aggcaaaagc tgggactcct acaatgggag gtttggtt-tt cttgattact 180 

tctgttttgg ttgctttctt tttcgcccta tttagtagcc aattcagtaa taatgtggga 240 

atgattttgt tcatcttggt cttgtatggc ttggtcggat ttttagatga ctttctcaag 300 

gtctttcgta aaatcaatga ggggcttaat cctaagcaaa aattagctct tcagcttcta 360 

ggtggagtta tcttctatct tttctatgag cgcggtggcg atatgctttc tgtctttggt 420 

tatcaagtgc atctagggat tttctatatt gttttcgctc ttttctggc.t agtcggtttt 4 80 

tcaaacgcag taaacttgac agacggtgtt gacggtttag ctagtatttc cgttgtgatt 540 
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i 

agtttgtctg cctatggagt tattgcctat gtgcaaggtc agatggatat tcttctagtg 600 

attctggcca tgattggtgg tttgctcagt ttcttcatct ttaaccataa gcctgctaag 660 

atctttatgg gtgatgtggg aagtttggct ttaggtggaa tgctggcagc tatctctatg 720 

gctctccacc aagaatggac tctcttgatt atcggaattg tgtatgtttt tgaaacaact 78P 

5 tctgttatga tgcaagtcag ttatttcaaa ctgacaggtg gtaaacgtat tttccgtatg 840 

acgcctgtac atcaccattt tgagcttggg ggattgtctg gtaaaggaaa tccttggagc 900 

gagtggaagg ttgacttctt cttttgggga gttgggcttc tagcaagtct cctgacccta 960 

gcaattttat atttgatgta a 981 

10 <210> 92 

<211> 2082 
<212> DNA 

<213> Streptococcus pneumoniae 

i 

15 <400>92 

atggcacgcg aattttcact tgaaaaaact cgtaatatcg gtatcatggc tcacgtcgat 60 
gccggtaaaa caacaactac tgagcgtatt ctttactaca ctggtaaaat ccacaaaatc 120 
ggtgaaactc acgaaggtgc gtcacaaatg gactggatgg agcaagagca agaacgtggt 180 
atcacgatca catctgctgc gacgacagct caatggaaca accaccgcgt aaacatcatc 240 
gacacaccag gacacgtgga cttcacaatc gaagtacaac gttctcttcg tgtattggat 300 
ggtgcggtta ccgttcttga ctcacaatca ggtgttgagc ctcaaactga aacagtttgg 360 
cgtcaagcaa ctgagtacgg agttccacgt atcgtatttg ccaacaaaat ggacaaaatc 420 
ggtgctgact tcctttactc tgtaagcaca cttcacgatc gtcttcaagc aaatgcacac 480 
ccaatccaat tgccaatcgg ttctgaag&t , gacttccgtg gtatcattga cttgatcaag 540 
atgaaagctg aaatctatac taacgacctt ggtacggata tccttgaaga agacatccca . 600 
gctgaatacc ttgaccaagc tcaagaatac cgtgaaaaat tgattgaagc agttgctgaa 660 
actgacgaag aattgatgat gaaatacctc gaaggtgaag aaatcactaa cgaagaattg 720 
aaagctggta tccgtaaagc gactatcaac gttgaattct tcccagtatt gtgtggttca 780 
gccttcaaaa acaaaggtgt tcaattgatg cttgatgcgg ttatcgacta ccttccaagc 840 
ccacttgaca tcccagcaat caaaggtatt aacccagata cagacgctga agaaattcgt 900 
ccagcatctg acgaagagcc atttgcagct ct-tgccttca agatcatgac tgacccattc 960 
gtaggtcgtt tgacattctt ccgtgtttac tcaggtgttc ttcaatcagg ttcatacgta 1020 
ttgaatactt ctaaaggtaa acgtgaacgt atcggacgta tccttcaaat gcacgctaac 1080 
agccgtcaag aaatcgacac tgtttactca ggtgatatcg ctgctgccgt tggtttgaaa 114 0 
35 gatactacaa ctggtgactc attgacagat gaaaaagcta aaatcatcct tgagtcaatc 1200 
aacgttccag aaccagttat ccaattgatg gttgagccaa aatctaaagc tgaccaagac 1260 
aagatgggta tcgcccttca aaaattggct gaagaagatc caacattccg cgttgaaaca 1320 
aacgttgaaa ctggtgaaac agttatctca ggtatgggtg aacttcacct tgacgtcctt 1380 
gttgatcgta tgcgtcgtga gttcaaagtt gaagcgaacg taggtgctcc tcaagtatct 144 0 
taccgtgaaa cattccgcgc ttctactcaa gcacgtggat tcttcaaacg tcagtctggt 1500 
ggtaaaggtc aattcggtga tgtatggatt gaatttactc caaacgaaga aggtaaagga 1560 
ttcgaattcg aaaacgcaat cgtcggtggt gtggttcctc gtgaatttat cccagcggtt 1620 
gaaaaaggtt tggtagaatc t'atggctaac ggtgttcttg caggttaccc aatggttgac 1680 
gttaaagcta agctttatga tggttcatat cacgatgtcg actcatctga aactgccttc 1740 
aagattgcgg cttcactttc ccttaaagaa gctgctaaat cagcacaacc agctatcctt 1800 
gaaccaatga tgcttgtaac aatcactgtt ccagaagaaa accttggtga tgttatgggt 1860 
cacgtaactg ctcgtcgtgg acgtgtagat ggtatggaag cacacggtaa cagccaaatc 1920 
gttcgtgctt acgttccact tgctgaaatg ttcggttacg caacagttct tcgttctgca 1980 
tctcaaggac gtggtacatt catgatggta tttgaccact acgaagatgt acctaagtca 2040 
50 gtacaagaag aaattattaa gaaaaataaa ggtgaagact aa 2082 

<210> 93 
<211> 1227 
<212> DNA 
55 <213> Streptococcus pneumoniae 

<400> 93 
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i t 

atgccaaatt acaatattcc attttcaccg cctgatatca cagaagcaga aattgctgaa 60 
gtagcggata ccctgcgttc tiggttggatc acaacaggtc ctaaaacaaa agaactggag 120 
cgccgcttgt ctctttacac acagacacct aagactgttt gtctcaactc tgcgacagcc 180 
gctctggagt tgattttacg cgttttggaa gtgggacctg gtgatgaagt catcgttcca 240 
gccatgacct atacggcttc atgtagtgtc attacgcacg tgggagcaac ccctgtcatg 300 
gtggatatcc aagcagatac gtttigagatg gactatgacc tgcttgagca agctatcact 360 
gagaaaacta aggtgattat cccagtagag ctcgcaggga ttgtttgcga ttatgaccgt 420 
ttgttccaag tcgtggagaa aaaacgtgac ttctttaccg cttcaagcaa gtggcaaaag 480 
gcctttaacc gtattgtcat tgtctctgat agtgcccacg ctttgggatc tacttataaa 540 
ggacaacctt ctggttctat cgctgatttt acttccttct cattccatgc cgttaagaac 600 
tttacaacgg cagaaggtgg aagtgcgact tggaaagcca atccagtgat tgatgacgaa 660 
gagatgtaca aggaattcca aatcctttcc cttcacgggc aaactaagga tgctcttgcc 720 
aagatgcaac. tggggtcatg ggaatacgat atcgttacac cagcctataa gtgcaacatg 780 
accgatatca tggcttcact tggtttggta caattggacc gctatccaag tttgttgcaa 840 
15 cgccgtaagg acattgtgga ccgctatgat agtggttttg caggttctcg catccatcct 900 
ttggcacaca agactgaaac tgtcgaatct tcacgccacc tctacatcac ccgtgtagaa 960 
ggagcaagcc tagaagaacg cagcctcatc atccaagaat tggctaaagc aggaattgca 1020 
agtaatgttc actacaaacc gcttcctctc ttgacagcct ataagaatct tggatttgat 1080 
atgacgaact atcctaaggc ctatgccttc tttgagaatg aaattaccct ccctcttcat 1140 
actaaattaa gcgatgaaga agtagactat atcattgaga ctttcaaaac agtttctgaa 1200 
aaagtgctaa ctttatcaaa aaaatga 1227 



10 



20 



<210> 94 
<211> 978 
25 <212> DNA 

<213> Streptococcus pneumoniae 



30 



<400> 94 

atgacagaac ctgatttttg gaacgataat attgcggccc aaaaaacgtc gcaagaattg 60 
aatgtattta aaaacactta caataccttc cataagatgg aagagttgca ggatgaagtc 120 
gaaattttat tggatttttt ggctgaagac gagtcagtgc atgatgaact ggtagcgcag 180 
ttagccgaac ttgataagat- aatgaccagc tacgagatga ctttactctt gtcagaacct 240 
tatgaccaca acaatgccat cttggaaatc catccaggtt ctggtggtac tgaggcgcag 300 
gactggggtg atatgttgct tcgtatgtat actcgttatg gtaatgctaa aggctttaaa 360 
35 gtggaagtgt tggattacca agcaggtgat gaggctggta ttaagtcggt aactttatca 420 
tttgaagggc ctaatgccta tggtctcctc aagtcagaaa tgggtgttca ccgcttagtg 480 
cgaatctcac catttgactc tgccaaacgt cgccatacct ctttcacatc tgtagaagtg 540 
atgccagaat tggatgatac tattgaagtg gaaatccgtg aagatgatat caagatggat 600 
/in acc ttccgtt caggtggtgc cggtggacaa aacgtcaata aagtttcaac aggtgtacgt 660 
40 ttaacccaca ttccaactgg aattgttgtc caatcaacag tggatcgtac ccagtatgga 720 
aatagagatc gtgccatgaa gatgttgcag gctaagctct atcaaatgga gcaagagaag 780 
aaggctgcgg aggtagattc tctcaaaggt gagaaaaagg agattacttg gggaagccaa 840 
atccgttctt atgtcttcac gccttatact atggtaaaag atcaccgaac tagctttgag 900 
gttgctcagg tagataaggt tatggatggg gacctagatg gttttatcga tgcttatctc 960 
45 aagtggcgaa ttagctaa ^18 

<210> 95 
<211> 750 
<212> DNA 
50 <213> Streptococcus pneumoniae 

<400> 95 

atgttttata cttatttgcg tggattagtt gtattgctct tatggtccat caatggcaat 60 
gctcactatc ataatactga taaaattcct aatcaagatg aaaattatat tttagttgcg 120 
55 cctcaccgta cctggtggga tcctgtttat atggcctttg cgaccaagcc aaaacagttc 180 
atctttatgg caaaaaaaga actctttacc aaccgtatct ttggttggtg gattcgtatg 24 0 
tgtggcgcct ttcccatcga ccgtgaaaat cccagcgcct cagccatcaa atatcctatc 300 
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aacgttctca aaaaaagtga ccgctctctc atcatgtttc caagtggtag ccgccactca 360 
aacgatgtca aggggggcgc agcactgatt gccaaaatgg ccaaggtccg tatcatgccg 420. 
gttacctaca ccggtcccat gactttgaag ggcttgjatta gccgtgaacg tgtcgatatg 480 
aactttggaa atccaatcga tatctcagat atcaagaaaa tgaatgatga aggcattgaa 540 
5 acagtcgcca atcgtattca aacagaattc caacgtctgg acgaagaaac gaaacaatgg 600 
cacaatgata aaaaaccaaa tccactctgg tggtttatcc 1 gcatccctgc cqtcatcctt 660 
gctattatcc tcgctatcct aaccatcatc tttagcttta tcgcaagctt catctggaac 720 
ccagataaga aaagagaaga acttgcatag 750 

» .i * 

10 <210> 96 . ' , 

<211> 3102 
<212> DNA 

<213> Streptococcus pneumoniae ' 1 ' 

i 

15 <400> 96 ; 

ttgatcgcac aactagatac aaaaacagtc tatagtttta tggaaagcgt catttcgatc 60 
gaaaagtatg tgagagcagc taaagaatac ggctaca'ctc atttggctat gatggatatt 120 
gacaatcttt atggcgcttt cgactttcta gagattacaa aaaaatacgg cattcatcct 180 
ttgctagggc ttgaaatgac agtgtttgta gatgatcagg gagtaaattt gcgcttttta 240 
20 gctctatcta gtgtgggcta tcagcagttg atgaagcttt cgacagccaa gatgcagggg 300 
gagaaaactt ( ggtcagtcct gtcccagtac ctggaggata tcgcggtcat tgtgccttat 360 
tttgatagag ttgagtcgtt agaactaggc tgtgattact atataggggt ttatccagaa 420 
acactagcaa gcgaatttca tcatcctatc ttacctcttt atcgggtcaa cgcttttgaa 480 
agcagggata gagaagttct tcaagtttta acagcgatta aagaaaatct accgctcaga 54 0 
25 gaagttccct tgcgttcgag acaagatgtc tttatatcag caagttcttt agagaaacta 600 
ttccaagagc gttttccgca agctttggac aatttagaaa agcttatttc aggcatttct 660 
tacgacttgg atactagtct gaaactgcct cgttttaatc cagctagacc agcagtagag 720 
gagttgagag agcgtgctga actggggctt , gttcagaagg ggttgactag taaagaatat 780 
caagatagac tagaccaaga attgtctgtt attcatgata tgggctttga tgattatttc 840 

30 ttggttgttt gggatttgtt gcgttttgga cgatcgaatg gctattatat gggaatggga 900 
aggggttctg cagtaggcag tttggtttct tatgccttag acatcacggg gattgaccca 960 
gtagagaaaa atctgatttt tgaacgcttt cttaatcgtg aacgctatac catgcctgat 1020 
attgatattg atatcccaga tatttatcgt ccagatttta tcagatatgt tggtaataaa 1080 
tatggtagta aacatgcggc acaaatcgtt actttttcaa cctttggagc caagcaagct 1140 

35 cttcgagatg tcttgaaacg ctttggtgtg ccagagtatg aattatctgc aattactaag 1200 
aaaatcagtt ttcgtgacaa tcttaagtcg gcctatgagg gcaatctcca gtttcgtcag 1260 
caaatcaata gtaagttaga ataccaaaaa gcttttgaga ttgcttgcaa gatagagggc 1320 
tatccaaggc aaacctctgt ccatgcggct ggtgttgtaa ttagtgacca agatttaacc 1380 
aactacattc ctctaaagta tggtgatgaa attccactga ctcagtatga tgctcatgga 14 4 0 

40 gttgaggcta gcggactttt gaagatggac tttctgggac tacgaaattt gacctttgtc 1500 
cagaagatgc aagagttgct tgctgaaata gaaggtattc accttaaaat tgaagaaata 1560 
gatttggaag acaaagaaac gttagattta tttgcctctg gaaatacaaa aggtatcttt 1620 
caatttgagc aacctggtgc tattcgcttg ctcaaacgtg ttcaaccagt ctgttttgaa 1680 
gatgtcgtag caactacttc tctaaatcga ccgggtgcta gtgactatat caataatttt 1740 

45 gtggcaagaa agcatgggca ggaagaagtg actgttctgg atccagtact ggaggatatt 1800 
ttggctccaa cctacggcat aatgctctat caggagcagg ttatgcaggt tgcccagcga 1860 
tttgccggat ttagtcttgg gaaagccgat attttgcgtc gggctatggg gaaaaaggat 1920 
gcctctgcca tgcatgagat gagggcttcc tttattcaag gttcattaga agctggtcat 1980 
actgtggaaa aagcagagca ggtctttgat gttatggaga agtttgcagg ctatggtttt 2040 

50 aacaggtcac acgcctatgc ctactcagca ttggccttcc agttggctta ttttaaaaca 2100 
cattatccag ccatatttta tcagatcatg ttgaattctg ccaacagtga ttacttaata 2160 
gatgcacttg aagcaggttt tgaagtggcg cctctgtcca tcaacacgat tccetatcac 2220 
gataaaattg ccaacaaggc catctatcta ggtttgaaat ccattaaagg agtcagtaat 2280 
gatttagctc tctggattat tgaacataga ccttattcta atattgaaga ttttatagct 2340 . 

55 aaattacctg agaattatct gaaacttcct ctgctagaac ctttggtaaa agttggtctt 2400 
ttcgattcat ttgaaaaaaa tcgtcaaaaa gtatttaata acttagctaa tctatttgaa 2460 
tttgtgaaag agttgggaag tttgtttgga gatgctattt atagttggca ggaatcggaa 2520 
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gattggacgg aacaagaaaa attttatatg gaacaagagc ttttagggat .a.ggtgtcagc 2580 

aaacatccac tacaagctat tgcaagtaag gctatttacc cgattacccc aatcggaaat 2640 

ttgtcagaaa atagctatgc tattattttg gttgaagttc agaaaataaa agtgattcgt 2700 

aceaaaaagg gtgaaaatat ggccttctta caggcagatg atagtaagaa aaaattggat 27£0 

5 gtcactctct tttcagactt atatcgtcag gttggacagg aaataaaaga gggagdcttc 2820 

tactatgtaa aaggaaaaat acaatcacgt gatggccgtc tgcaaattgat tgcacaag.aa- 2880 

ataagagaag cagttgctga acgcttttgg atacaggtga aaaatcatga atcggatcaa 2940 

gaaatttcac gtattttaga acaatttaaa ggcccaatcc cagtcatcat ■ ccggtatgaa 3000 

gaggaacaga aaaccatcgt ttctccccat cattttgtag ctaaatccaa tgaattagag 3060 

10 gagaaattga atgaaatcgt tatgaaaacg atttatcgct aa , , 3102 

<210> 97 
<211> 921 
<212> DNA 
15 <213> Streptococcus pneumoniae! 

<400> 97 I , 

atgaccaacg aatttttaca ttttgaaaaa atcagccgcc agacttggca atctttacat 60 
cgaaagacaa cacctccttt gacagaagaa gaattggaat ctatcaagag ttttaatgac 120 
caaatcagtc tccaagacgt tacagatatc tatctcccct tggctcatct gattcagatt 180 
tacaagcgaa ctaaggaaga tttagccttt tcaaaaggaa ttttcctcca acgtgaaagt 240 
aaatctcaac cttttattat tggggtttct gggagtgttg ccgttggaaa atccacaacc 300 
agtcgcctac ttcaaatcct actgtcccgt acgtttacag atgctacggt tgagttggtt 360 
acaactgatg gttttctcta tcccaatcaa accttgattg agcagggaat tttaaatcgt 420 
25 aaaggatttc ctgaaagcta tgatatggia ! gctcttctca acttcttgga ccgcatcaaa ,480 
aatggacaag atgtagatat tcctgtctat tctcatgaag tttacgacat cgtacccaaa 540 
aagaaacaaa gtgtcaaagc tgctgatttt gtaatcgttg agggaattaa tgtctttcaa 600 
aatccacaaa acgatcgtct ctatatcact gacttctttg acttttccat ctatgtagat 660 
gctggagtgg atgatattga aagttggtat ctggaccgtt tcttgaaaat gctgagtcta 720 
gcccaaaacg accctgatag ctactattat cgttttactc agatgccgat tggggaagtg 780 
gagtcctttg cccatcaggt ctggaccagt atcaatctca caaatctgca aaattatatt 840 
gaaccaacca gaaatcgtgc agaagtgatt cttcataaaa gcaagaacca tgaaatcgat 900 
gaaatttact taaaaaagta a 92i 

35 <210> 98 
<211> 741 
<212> DNA 

<213> Streptococcus pneumoniae 
40 <400> 98 

atggaaattt cattattaac agatgttggt cagaaacgaa caaataacca agactatgtc 60 

aaccactatg tcaatagagc tggacgtacc atgattattt tagctgatgg gatgggaggt 120 

catcgtgcag ggaatatcgc tagt'gaaatg gcggtcacag acctgggtgt agcttgggtt 180 

gatacccaga tcgatacagt caatgaagtg cgtgaatggt tcgcccatta cctagaaatt 240 

gaaaatcaaa agattcacca gcttggtcag gatgaagctt acagaggcat gggaactact 300 

ttggaagtcc ttgctattat tgataatcag gctatctatg ctcatattgg tgattcgcgt 360 
atcggcttga ttcgtggaga agaataccat- cagttgacga gcgatcattc cttggttaat • 420 

gaattgctca aggctggtca attgacacca gaagaggcag aagctcatcc gcaaaaaaat 480 

attatcaccc agtctattgg gcaaaaagat gaaattcagc ctgattttgg gacagttatc 540 

cttgagtcag gtgactatct cttgctcaat agtgacggct tgaccaacat gatttcaggc 600 

agtgagattc gtgatattgt aaccagtgat attcctttag cagataaaac ggagacactt 660 

gttcgttttg ctaacaatgc aggaggttta gacaacatta cggttgccct tgtttctatg 720 

aacgaggagg atgaagaatg a* 74^ 

55 <210> 99 
<211> 831 
<212> DNA 
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<213> Streptococcus pneumoniae 



<400> 99 

, gtgacgatac agatgaagaa tacaggtaaa 
5 cagagtcaaa gggtcttgta tgaattgcga 
aatgatacca atccggatat tgtcatttcc 
tttcataagt acgaaaatca. gcttgacaag 
ttgggcttct atacagatta tcgtgatttt 
ctagatactg gggcaagggt ttcttaccct 

10 ggtgaagtta agattttcag agcactcaac 
atggtggcag atattgtaat aaatggtgtt 
acagtttcga caccgactgg tagtactgcc 
caccctacca ttgaagcttt gcaattaacg 
cgaacactgg gctcttccat tattgtgcct 

15 agaaacgatt atcatactat ttcggttgac 
cgtattgagt atcaaatcga ccatcataag 
agtttctgga accgtg.ttaa ggatgccttt 

<210> 100 
20 <211> 1623 
<212> DNA 

<213> Streptococcus pneumoniae 



cgaattgatc tgatagccaa tagaaaaccg 60, 
gatcgtttga agagaaatca gtttatactc 120 
attggcgggg atggtatgct cttgtcggpc ; 180 
gtccgcttta tcggtcttca tactggacat 240 
gagttggaca agctagtgac taatttgcag 300 
gttctgaatg tgaaggtctt tcttgaaaat 360 
gaagccagca tccgcaggtc tgatcgaacc 420 
ccctttgaac gttttcgtgg agacgggcta 480 
tataacaagt ctcttggcgg tgctgtttta, 540 
gaaattgcca gccttaataa ' tcgtgtctat 600 
aagaaggata agattgaact tattccaaca 660 
aatagcgttt attctttccg taatattgag 720 
attcactttg tcgcgactcc tagccatacc 780 
atcggtgagg tggatgaatg a ! '' 831 



<400> 100 

25 atgtcaaaag aaattaaatt tt'catcagat 
attcttgcag' acactgttaa agtaaccttg 
aagtcattcg gttcaccctt gattaccaat 
ttggaagacc attttgaaaa tatgggtgct 
aatgatatcg caggtgacgg aactacgact 

30 gaaggaatca aaaacgtcac agcaggtgca 
acagcagttg ccgcagcagt tgaagctttg 
gaagctatcg ctcaagttgc agcagtatct 
tctgaagcaa tggaaaaagt tggcaaagac 
atggaaacag agcttgaagt cgtagaagga 

35 tacatggtga cagatagcga aaaaatggtg 
acagacaaga aaatttccaa catccaagaa 
agcaatcgtc cactcttgat tattgcggat 
gttttgaaca agattcgtgg aaccttcaac 
gaccgtcgca aagccatgct tgaagatatc 

40 gaagaccttg gtcttgagtt gaaagatgcg 
gtgaccgtgg acaaagatag cacggttatt 
tctcaccgtg ttgcggttat caagtctcaa 
gaaaaattgc aagaacgctt ggccaaattg 
gccgcaactg aaactgagtt gaaagaaatg 

45 actcgtgcag ctgttgaaga aggtattgtt 
attccagctg ttgctacctt ggaattgaca 
ctccgtgctt tggaagaacc cgttcgtcaa 
atcgttatcg atcgtttgaa aaatgctgag 
gagtgggtta acatgattga tcaaggtatc 

50 ctacaaaatg cagcatctgt agccagcttg 
aaaccagaac cagtagcccc agctccagca 
taa 

<210> 101 
55 <211> 1446 
<212> DNA 

<213> Streptococcus pneumoniae 



gcccgttcag ccatggttcg tggtgtcgat ,60 
ggaccaaaag gtcgcaatgt cgttcttgaa 120 
gacggtgtga ccattgccaa agaaatcgaa 180 
aagttagtat cagaagtagc ttctaaaacc 240 
gcaacagtct tgacccaagc tatcgtccgt 300 
aatccaatcg gtattcgtcg tgggattgaa 360 
aaaaacaacg ccatccctgt tgccaataaa 420 
tctcgttctg aaaaagttgg tgagtacatc 480 
ggtgtcatca ccatcgaaga gtcacgtggt 540 
atgcagtttg accgtggtt.a cctttcacag 600 
gctgaccttg aaaatccgta cattttgatt 660 
atcttgccac ttttggaaag cattctccaa 720 
gatgtggatg gtgaggctct tccaactctt 780 
gtagtagcag tcaaggcacc tggttttggt 840 
gccatcttaa caggcggaac agttatcaca 900 
acaattgaag ctcttggtca agcagcgaga 960 
gtagaaggtg caggaaatcc tgaagcgatt 1020 
atcgaaacta caacttctga atttgaccgt 1080 
tcaggtggtg tagcggttat taaggtcgga 1140 
aaactccgca ttgaagatgc cctcaacgct 1200 
gcaggtggtg gaacagctct tgccaatgtg 1260 
ggagatgaag caacaggacg taatattgtt 1320 
attgctcaca atgcaggatt tgaaggatct 1380 
cttggtatag gattcaacgc agcaactggc 14 40 
attgatccag ttaaagtgag tcgttcagcc 1500 
attttgacaa cagaagcagt cgtagccaat 1560 
atggatccaa gtatgatggg cgggatgatg 1620 

1623 
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<400> 101 

atgattaaga ttgaaaccgt attagatatt ttaaagaaag atggcctttt tcgcgaaatt 60 

attgaccaag gtcattacca ctacaactac aigc«aaagtta tttttgatag catcagctac 120 

5 gacagccgaa aagtaacaga agacactctt ttttttgcaa aaggcgctgc ctttaaaaafc. 180 

gaataccttc tttctgctat aacacaaggt ttagcttggt atgtagctga aaaggactac 240 

gaagtcgata tccctgtcat cattgtgaac gatataaaga aagccatgag tttgattgcc 300 

atggagttct atggtaatcc acaagagaaa ctcaaactcc ttgcctttac tggtactaag 360 

ggtaagacaa cagcaaccta tttcgcctat aacatcttat ctcaggggca tagacctgct 420 

10 atgttgtcga ccatgaacac aactcttgat ggcgagactt tctttaagtc agcgttgaca 4 80 

acccctgaga gtattgacct ctttgacatg atigaatcagg qtgtgctaaa tgaccgtacc 540 

cacctcatca tggaagtctc cagtcaagcc tatctagtcc atcgagtcta tiggactgacc 600 

tttgatgtag gagtctttct taacatcact cctgaccata tcggcccgat tgaacaccct 660 

agctttgaagi actatttcta ccacaagcgt ctcttgatgg aaaatagccg agcagtcatc 720 

15 attaacagtg acatggacca cttctcagtc ttgaaagaac aggttgaaga tcaagaccat 780 

gatttctatg gtagccaatt. tgataaccaa atcgagaatt ccaaagcctt tagcttttca 840 

gctacgggta aactcgctgg agattatgat atccaactca' ttggcaactt caaccaagaa 900 

aatgcagttg ctgctggact tgcttgtctc cgtctcggag caagtcttga ggacatcaaa 960 

aaaggcatcg ctgcaacccg cgttcctggt cgtatggaag tcctcactca gaaaaatgga 1020 

20 gcfcaaggtct tcatcgacta tgcccacaat ggggatagtc tgaaaaiaact catcaatgtg 1080 

gttgaaactc atcaaaccgg aaagattgct ctggttctgg gatcaacagg aaacaaggga 1140 

gaaagtcgtc gtaaggactt tggcctcctc ctcaatcaac accctgagat tcaagtcttt 1200 

ctgactgctg atgaccctaa ctatgaagac ccaatggcca ttgcagatga aattagtagc 1260 

tacatcaatc atcctgttga aaagattgcg gatcgccaag aagccatcaa ggcggcaatg 1320 

25 gctatcacaa atcacgaatt agatgcagtt attattgcgg gtaagggagc cgattgttac 1380 

caaatcatcc agggcaagaa agaatcctac ccaggagata cagccgtcgc agaaaattat 14 4 0 

ttataa 1446 

<210> 102 
30 <211> 1980 
<212> DNA 

<213> Streptococcus pneiomoniae 
<400> 102 

35 atgatccaaa tcggcaagat ttttgccgga cgctatcgga ttgtcaaaca gattggtcga 60 
ggaggtatgg cggatgtcta cctagccaaa gacttaatct tagatgggga agaagtggca 120 
gtgaaggttc tgaggaccaa ctaccagacg gacccgatag ctgtagctcg ttttcagcgt 180 
gaagcgagag ctatggcaga tctagaccat cctcatatcg ttcggataac agatattggc 240 
gaggaagacg gtcaacagta cctagctatg gagtatgtgg ctggactgga cctcaaacgc 300 

40 tatatcaagg aacattatcc tctttctaat gaagaagcag tccgtatcat gggacaaatt 360 
ctcttggcta tgcgcttggc ccatactcga ggaattgttc acagggactt gaaacctcaa 420 
aatatcctct tgacaccaga tgggactgcc aaggtcacag actttgggat tgctgtagcc 480 
tttgcagaga caagtctgac ccagactaac tcgatgttgg gctcagttca ttacttgtca 54 0 
ccagagcagg cgcgtggttc gaaggcgact gtgcagagtg atatctatgc catggggatt 600 

45 attttctatg agatgctgac aggccatatc ccttatgacg gggatagcgc ggtgaccatt 660 
gccctccagc atttccagaa acccctgccg tccgttattg cagaaaatcc atctgtacct 720 
caggctttag aaaatgttat tatcaaggca actgctaaaa agttgaccaa tcgctaccgc 780 
tcggtttcag agatgtatgt ggacttgtct agtagcttgt cctacaatcg tagaaatgaa 840 
agtaagttaa tctttgatga aacgagcaag gcagatacca agaccttgcc gaaggtttct 900 

50 cagagtacct tgacatctat tcctaaggtt caagcgcaaa cagaacacaa atcaatcaaa 960 
aacccaagcc aggctgtgac agaggaaact taccaaccac aagcaccgaa aaaacataga 1020 
tttaagatgc gttacctgat tttgttggcc agccttgtat tggtggcagc ttctcttatt 1080 
tggatactat ccagaactcc tgcaaccatt gccattccag atgtggcagg tcagacagtt 1140 
gcagaggcca aggcaacgct caaaaaagcc aattttgaga ttggtgagga gaagacagag 1200 

55 gctagtgaaa aggtggaaga agggcggatt atccgtacag atcctggcgc tggaactggt 1260 
cgaaaagaag gaacgaaaat caatttggtt gtctcatcag gcaagcaatc tttccaaatt 1320 
agtaattatg tcggtcggaa atcctctgat gtcattgcgg aattaaaaga gaaaaaagtt 1380 
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ccagataatt tgattaaaat tgaggaagaa gagtcgaatg agagtgaggc tggaacggtc 1440 

ctgaagcaaa gtctaccaga aggtacgacc tatgacttga gcaaggcaac tcaaattgtt 1500 

ttgacagtag ctaaaaaagc tacgacgatt caattaggga actatattgg acggaactct 1560 

acagaagtaa tctcagaact caagcagaag aaggttcctg agaatttgat taagatagag 16£0 

5 gaagaagagt ccagcgaaag cgaaccagga acgattatga aacaaagtcc aggtgccgga 1680 

acgacttatg atgtgagtaa acctactcaa attgtcttga' cagtagctaa aaaagttapa- 1740 

agtgttgcca tgccgagtta cattggttct agcttggagt ttactaagaa caatttgatt 1800 

caaattgttg ggattaagga agctaatata gaagttgtag aagtgacgac agcgcctgca 1860 

ggtagtgcag a.aggcatggt tgttgaadaa agtcctagag caggtgaaaa ggtagacctc 1920 

10 aataagacta gagtcaagat ttcaatctac aaacctaaaa 1 caacttcagc tactccttaa 1980 

<210> 103 
<211> 1176 
<212> DNA 
15 <213> < Streptococcus pneumoniaej 

<400> 103 I i 

atgaaacatt ttgatactat tgtcatcggt gggggacctg ctggtatgat ggctacgatt 60 
tccagtaact tttatggaca gaaaaccctc ctcatcgaaa aaaatcggaa acttggaaaa 120 

20 aaattagctg ggactggtgg gggacgttgc aatgtgacca acaatggtag cttagacaac 180 
ctgctagctg gaattcctgg aaacggacgc tttctttaca gtgttttctc ccagttcgat 240 
aatcatgaca tcatcaactt ttttacagaa aatggtgtta aacttaaggt cgaagaccac 300 
ggacgcgtct ttccagccag tgacaagtct cggactatta tcgaagcttt ggaaaagaaa 360 
atcactgaac taggtggtca agttgctact ( caaatagaaa tcgtttctgt taaaaaagta 420 

25 gatgaccagt ttgtccttaa gtcagcgg^t caaaccttca cttgtgagaa actcattgtc ,4 80 
acaacaggtg gtaagtctta tccttcgact ggttcgactg gttttggtca cgagattgct 540 
cgccatttta agcataccat caccgatctt gaggctgctg aaagtccttt attaacagat 600 
tttccacata aagccttaca agggatttct ctggacgatg tgaccctaag ttatggtaag 660 
catgtcatca ctcatgattt actctttacc cactttggtt tgtcaggtcc tgctgcccta 720 

30 cgcatgtcta gctttgtcaa aggtggggag gttctctcac tcgatgtttt gcctcaactt 780 
tctgagaagg acttggttac atttctagaa gaaaatcggg aaaaatcctt gaaaaacgct 840 
ttaaaaacct tgttaccaga acgcttggcc gaattttttg tacaaggata tcctgaaaaa 900 
gtcaaacagc tgactgaaaa ggaacgagaa caacttgtcc agtccattaa agaacttaaa 960 
attcctgtaa ctggaaaaat gtcccttgca aagtcctttg ttaccaaggg tggagtcagt 1020 

35 ctcaaggaaa tcaatcctaa aacccttgaa agtaagctgg tacctggcct ccactttgca 1080 
ggcgaagtta tggatatcaa tgcccacacg ggtggcttta acatcacttc tgccctctgt 114 0 
accggctggg tggcgggaag tctgcattat gattaa 117 6 

<210> 104 
40 <211> 696 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 104 

45 atgctgaaat gggaagactt gcctgtggaa atgaaatcaa gcgaggttga gtcttactac 60 
cagcttgtct ctaaaaggaa gggttcgctg attttoaagc gttgcttgga ctgggttttg 120 
gccttggtgc ttacatgggt tctaacttct cccatctttc tcatcttgag catttggatc 180 
aagttggata gcaaggggcc agtgatttac aagcaagagc gtgtgaccca gtacaaccgt 24 0 
cggttcaaga tttggaagtt ccgtaccatg gtgacggatg cggataaaaa aggaagtctg 300 

50 gtgacttctg ctaacgatag ccgtattacc aaggttggaa atttcatccg acgtgtccgt 360 
ttggacgaac tgcctcagtt ggtcaatgtc cttaaaggtg agatgtcctt tgtcggtaca 420 
cgacctgaag tgccacgtta tacagagcag tatagccctg aaatgatggc aaccttgctc 4 80 
ttgcaagcag gaattacctc tccagccagc atcaactaca aggatgagga caccatcatc 540 
agtcaaatga cggagaaagg tctgtcagtt gatcaggcct atgtggagca tgttcttcct 600 

55 gaaaagatgc gctataacct cgcctatctc cgagagttta gtttctttgg ggacatcaaa 660 
atcatgtttc aaaccgtgtt tgaggtacta aaataa 696 
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<210> 105 

<211> 423 i . , 

<212> DNA ' 

<213> Streptococcus pneumoniae ',,/>, ., ,. ' 

<400> 105 

atgactagtc caetattaga atctagacgc caactccgta aatgcgcttt tcaag'ctctc 60 
atgagccttg agttcggiac ggatgtcgaa actgcttgtc gtttcgccta tactcatgat 120 
cgtgaatata cggatgtaca acttccagcc tttttgatag acctcgtttc tggtgttcaa 180 
gctaaaaagg aagaactaga taagcaaatc actcagcatt taaaagcagg ttggaccatt 240 
gaacgcttaa cgctcgtgga gagaaacctc cttcgcttgg gagtctttga aatcacttca 300 
tttgacactc ctcagctggt tgctgttaat gaagctatcg agcttgcaaa ggacttctcc 360 
gatcaaaaat ctgcccgttt tatcaatgga ctgctcagcc agtttgtaac agaagaacaa 420 
taa ■ , , 423 

<210> 106 
<211> 3540 
<212> DNA 

<213> Streptococcus pneumoniae 1 



<400> 106 

atgtatttaa aggaaatcga aattcagggg ttcaagtctt ttgctgataa gaccaaggtc 60 
gtttttgacc aaggtgtgac ggcagttgtt ggacccaatg gatctggaaa gtccaatatt 120 
acagaaagtc tgcgttgggc tttgggggag tctagtgtca agagtctccg tgggggcaag 180' 
25 atgccggatg tcatctttgc tggaacagaa agtcgcaaac cgctcaatta tgcttctgta 240 
gttgtgactc tggatdatca tgacggattt atcaaggatg caggtcaaga aatcagggtg 300 
gaacgccata tctatcgtag tggagatagc gaatacaaga ttgacggcaa gaaagtccgt 360 
ctgcgtgata-ttcatgacct cttcttggat actggattgg gacgagattc cttctctatt 4 20 
atttcccaag ggaaggttga ggagattttt aattccaagc ctgaggaacg acgagctatt 4 80 
tttgaagaag ctgctggagt tttaaaatac aagactcgca gaaaagaaac cgagagtaaa 540 
ctgcagca^a ctc^ggataa tctggaccgc ttagaggaca ttatctacga gttggataat 600 
caaatcaagc ctcttgagaa gcaagctgag aatgcccgta agtttttaga cttggaagga 660 
caacgtaagg ctatttattt agacgttctg gttgctcaaa tcaaggaaaa taaggcagaa 720 
ctagagtcga cagaagaaga gttggctcag gttcaagaac tcttgatgag ttattaccaa 780 
35 aagcgtgaaa aattagaaga agaaaatcaa actcttaaaa agcaacgcca agatttacag 840 
gctgaaatgg ccaaagacca aggcagtttg atggacttga ctagtctgat tagtgattta 900 
gaaagaaaat tagccctatc gaaactggag tccgagcaag tggccctgaa tcaacaggag 960 
gcacaagctc gtttggctgc tttggaggat aagagaaatt cactcagcaa agaaaagtat 1020 
gataaagaaa gctctttagc tctgttagag ggaaatctag tccaaaataa tcaaaaactc 1080 
aatcgtttag aagctgaatt gctggctttc tcagacgatc ctgatcagat gattgagctc 1140 
ttacgtgaac gctttgtagc tcttttacaa gaagaagcgg atgtctcaaa ccagttgacc 1200 
cgtattgaga atgagttgga aaatagtcgt cagctttctc aaaaacaagc agatcaacta 1260 
gaaaagctga aagagcaatt agctacagct aaagagaagg ctagtcagca aaaagacgag 1320 
cttgaaactg ccaaggtgca ggttcagaaa ttattggctg actatcaagc tattgccaag 1380 
gagcaagagg agcagaaaac ttccta'tcaa gctcaacaaa gtcaactctt tgaccgtctg 1440 
gatagtctca aaaacaagca ggccagagct * caaagtttgg aaaatatcct gagaaatcat 1500 
agtaactttt atgcaggtgt taagagtgtt ctccaagaaa aagatcgcct aggtgggatt 1560 
attggtgcag tcagtgagca tctgaccttt gatgtttatt atcaaactgc cctagagatt 1620 
gccttagggg caagtagcca gcatatcatc gtagaagatg aagagtcggc aaccaaagct 1680 
attgatttcc tcaaacgaaa cagagtcggt cgtgcaacct ttcttccttt gaccactatt 1740 
aaggcgcgta cgatttctag tcagaaccaa gatgctatcg ctgtaagccc aggtttcctt 1800 
gggatggcag atgagttggt gacttttgat actagactgg aagccatttt caagaacttg 1860 
ctagctacga cggctatctt tgataccgta gaacatgcgc gtgaagctgc tcgacaagtt 1920 
cgttatcagg ttcgtatggt gacattggat gggacagaat tacgcacggg tggttcctat 1980 
55 gcgggtggtg ccaatcgcca aaataacagt attttcatca agccagaact ggagcaatta 2040 
caaaaagaaa ttgctgcaga tgaagcaagc ttgggttcag aagaagcggc tttgaagacc 2100 
ttgcaagacc agatggctgc attgacagaa agattagaag ccatcaaatc tcaaggagag 2160 
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caggcacgta ttcaggagca aggcttgtcc ctcgcttatc agcaaactag tcagcaagtt 2220 
gaagaactgg aaactctttg gaaactccaa gaagaggaaa tagatcgtct tt'ctgaggga 2280 
gattggcaag cggataagga aaaatgtcaa gagagccttg ctactatcgc cagtgacaag 2340 
caaaatctgg aagctgag^t tgaagaaatt aagtctaata aaa^cgccat ccaagaacgc 2400 
5 tatcaaaatt tgcaggaaga ggtggcgcaa gctcgcttgc tt;aagacaaa actgcaaggg 24 60 
caaaaacgtt atgaagtagc tgatattgag cgtttaggca. aggaattgga caat'cfctaat 2520 
atcgaacaag aagaaattca gcgcatgctc caagaaaaag ttgacaatct tgagaaggtt 2580 
gatacagaat. tgctcagtca acaggcggaa gaatccaaaa ctcagaaaac aaatctccaa 2640 
caaggtttga ttcgcaagca gtttgagttg gatgatatag aaggtcaact ggatgatatt 2700 

10 gccagtcact tggatcaagc tcgccagcag aatgaggagt ggattcgcaa gcaaacacgt 27 60 
gctgaagcca agaaagaaaa ggtcagcgag cgcttgcgcc , atctacaaaa tpaattaaca 2820 
gaccagtacc agattagcta tactgaagca ctagaaaagg , cacatgaatt ggaaaacctc 2880 
aatctggcag agcaagaggt gcaggattta, gagaaggcta ttcgctcatt gggacctgtc 2940 
aacttggaag' ctattgacca gtacgaagaa gttcacaacc gtctggactt tctaaatagt 3000 

15 cagcgagatg atattt,tgtc agcgaaaaat ctgctccttg* aaaccattac agagatgaat 3060 
gatgaggtca aggaacgctt taaatcaacc tttgaagcta ( ttcgtgagtc ctttaaagtg 3120 
acctt.caagc agatgtttgg cggaggtcag gcagacttga tattgactga gggcgacctt 3180 
ttaacagctg gtgtggagat ttctgttcaa cctccaggta agaaaatcca gtcgcttaac 3240 
ctcatgagtg gtggtgaaaa agctctatcg gctcttgcct tgcttttctc cattattcgt 3300 

20 gtcaagacca ttccttttgt catcttggat gaggtggaag ctgcgctgga tgaagccaat 3360 
gttaaacgtt ttggggatta cctcaaccgc tttgacaagg acagccagtt tatcgtcgta 3420 
acccaccgta 1 agggaaccat ggcagcggct gattccatct atggagtgac catgcaagaa 34 80 
tcaggtgtct caaaaattgt ttcggttaag ttaaaagatt tagaaagtat tgaaggatga 354 0 

25 <210> 107 
<211> 1344 

<212> DNA - 

<213> Streptococcus pneumoniae 

30 <400> 107 

atgacaaaac gtgtaacgat tattgacgta aaagactatg ttggtcagga agtgacgatt 60 
ggcgcttggg ttgccaacaa atcaggaaaa ggaaaaatcg ctttcttaca attgcgtgat 120 
ggaacagcct tctttcaagg tgtggctttt aaaccaaact ttgtcgaaaa atttggtgaa 180 
gaagtgggac ttgagaagtt tgatgttatc aaacgcttga gccaagaaac gtctgtttat 240 

35 gtgacaggta ttgtcaaaga ggacgaacgt tctaaatttg gctatgagtt ggacatcaca 300 
gacatcgaag tgatcggtga atctcaagac tacccaatca caccaaaaga acacggaaca 360 
gactttttga tggataaccg tcacttgtgg ctacgctctc gtaagcaagt agctgtgttg 420 
caaatccgta acgctattat ctatgcaact tatgagttct ttgacaagaa cggttttatg 480 
aagtttgaca gcccaattct ttcaggaaat gcggcagaag attctacaga actctttgaa 540 

40 actgactact tcggaacgcc agcctacttg agccaatcag gtcagcttta cctagaagca 600 
ggggctatgg ctcttggtcg tgtctt'tgac tttggtccag ttttccgtgc tgaaaaatca 660 
aaaacacgcc gtcacttgac tgagttctgg atgatggatg ctgagt'actc atacttgaca 720 
catgatgagt cgcttgactt gcaagaagct tatgtgaaag ctcttctaca aggtgttctt 780 
gaccgcgcgc ctcaagcctt ggaaaccttg gaacgtgata cagaactctt gaaacgctac 8 40 

45 attgcagagc cattcaaacg tatcacttac gatcaagcca ttgacctctt gcaagagcat 900 
gaaaatgatg aagatgctga ctacgagcat cttgagcatg gtgatgactt tgggtcacca 960 
cacgaaactt ggatttcaaa ccactttggt gtgccaacat ttgtcatgaa ctatccagca 1020 
gccatcaagg ccttctacat gaaaccagtt cctggaaatc cagagcgcgt gctttgtgca 1080 
gacttgcttg ctccagaagg ctat'ggagaa attatcggtg ggtctatgcg tgaggaagat 1140 

50 tacgatgccc ttgtcgctaa gatggatgaa cttggcatgg atcgtacaga atatgaattc 1200 
taccttgacc ttcgtaaata cggtacagtt ccacacggag gatttggtat cggtatcgaa 1260 
cgtatggtaa ccttcgcagc aggaacaaaa catatccgtg aagctattcc attcccacgt 1320 
atgttgcacc gtatcaaacc ataa 1344 

55 <210> 108 
<211> 927 
<212> DNA 
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<213> Streptococcus pneumoniae* 
<400> 108 

atgtctgaaa aattagtaga aatcaaagat ttagaaattt ccttcggtga aggaagtaag 60 

5 aagtttgtcg cggttaaaaa tgctaacttc tttatcaaca agggagaaac tttctcgctt, 12t) 

gtaggtgagt ccggtagtgg gaaaacaact attggtcgtg ctatcatcgg tctaaatgat 180 

acaagtaatg gagatatcat ttttgatggt caaaagatta atggtaagaa atcgcgtg'aa 240 

caagctgcgg aattgattcg tcgaatccag atgattttcc aagaccctgc cgcaagtttg 300 

aatgaacgtg cgactgttga ttatattatt tctgaaggtc tttacaatca ccgtttattt 360 

10 aaggatgaag aagaacgtaa agagaaagtt caaaatatta tccgtgaagt aggtcttctt 420 

gctgagcact tgactcgtta ccctcatgaa ttctcaggcg gtcaacgtca acgtatcggt 480 

attgcccgtg ccttggtcat gcaaccag^c tttgttattg cagatgagcc aatttcagcc 540 

ttggacgttt ctgtacgtgc ccaagtcttg aacttgctca aaaaattcca aaaagagctc 1 600 

ggcctgacct atctcttcat cgcccatgac ttgtcggttg ttcgctttat ttcagatcgt 660 

15 atcgcagtta tttacaaggg tgttattgta gaggttgcag aaacagaaga attgtttaac 720 

aatccaattc acccatatac tcaagccttg ctttcagcgg taccaatccc agatccaatc 780 

ttggaacgta agaaggtctt gaaggtttac gacccaagtc aacacgacta tgagactgafc 840 

aagccgtcta tggtagaaat ccgtccaggt cactatgttt gggcgaacca aaccgaattg 900 

gcacgttatc aaaaaggact aaactag ' 927 



20 



25 



<210> 109 
<211> 1275 
<212>'DNA 

<213> Streptococcus pneumoniae 



<400> 109 

atgaagataa gttggaatgg attttctaaa aaatcatacc aagagcgcct cgagctgcta 60 
aaagctcagg cgctccttag tcctgagaga caagctagtc tggagaagga tgaacagatg 120 
agtgtgactg tggcagacca gctgagtgag aatgtggtgg gaactttttc tctgccttat 180 

30 tcgctggttc cggaggtact tgtcaacggt caggaataca ccgttcccta tgtgacagaa 240 
gaaccctctg tggttgcggc ggccagctat gccagcaaaa tcatcaagcg tgcaggtggt 300 
tttactgcac aagtccatca gcgccagatg attgggcagg tagcccttta tcaaattgct 360 
aatcctaaac tagcgcaaga gaagattgcc agcaagaaag cggagctctt ggagcttgcc 420 
aatcaagcct atccttctat cgttaaacgt gggggtgggg cgcgtgatct gcatgtcgag 4 80 

35 cagataaaag gcgaaccaga ctttctcgtt gtttatattc atgtcgatac ccaggaagcc 540 
atgggtgcca atatgctcaa caccatgctg gaagccttga aaccagtctt agaagaactc 600 
agtcagggac agagtctcat gggaatcctg tccaactacg cgactgattc tctggtgact 660 
gcaagctgtc gcatcgcctt tcgctacttg agccgccaaa aggatcaagg acgagagatt 720 
gcggagaaaa ttgcgttggc tagtcagttt gcgcaggctg atccttaccg agctgctact 780 

40 cataataaag gaatttttaa tggtattgat gcgattttga ttgccactgg taatgactgg 840 
cgtgccatcg aagctggggc ccatgccttt gccagtcgag atggacgcta tcaaggtctt 900 
agctgctgga cgctggacct tgaaagagaa gaattggtcg gtgagatgac cctgcccatg 960 
cctgtagcga ctaagggtgg ctctatcggc ctcaacccac gtgtagctct cagtcatgat 1020 
ctactaggaa atccttctgc cagagaatta gcccagatta tcgtgtccat cggtcttgct 1080 

45 caaaattttg cagccctcaa agccttggta agtacgggca tccagcaagg ccacatgaaa 1140 
ctacaggcca aatccctagc tctcctagct ggggctagtg aatctgaagt tgctccccta 1200 
gtagagcgcc tcatctcaga taaaaccttt aacctagaga cagcccagcg ctatctcgaa 1260 
aatttaagat cataa 1275 

50 <2io> no 

<211> 789 
<212> DNA 

<213> Streptococcus pneumoniae 
55 <400> 110 

atgccaatta catcattaga aataaaggac aagacttttg gaactcgatt cagaggtttt 60 
gatccagaag aagtcgatga atttttagat attgtggttc gtgattacga agatcttgtg 120 
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cgtgcgaatc atgataaaaa tttgcgta'tt aagagtttag aag^gcgttt gtcttacttt 180 
gatgaaataa aagattcatt gagccagtct gtattgattg ctcaggatac agctgagaga 240 
gtgaaacagg cggcgcatga acgttcaaac aatatcattc atcaagcaga gcaagatgcg 300 
caacgcttgt tggaagaagc taaatataag gcaaacgaga ttcttcgtca agcaactgat 360 
aatgctaaga aagtcgctgt tgaaacagaa gaattgaaga acaagagccg tgtcttccac 420 
caacgtctca aatctacaat tgagagtcag ttggctattg ttgaatcttc agattgggaa 480 
gatattctcc gtccaacagc tacttatctt caaaccagtg atgaagcctt taaagaagtg 540 
gttagcgaag tacttggaga accgattcca gctccaattg aagaagaacc aattgatatg 600 
acacgtcagt tctctcaagc agaaatggca gaattacaag ctcgtattga ggtagccgat 660 
aaagaattgt ctgaatttga agctcagatt aaacaggaag tggaagctcc aactcctgta 720 
gtgagtcctc aagttgaaga agagcctcig ctcatccagt tggcccaatg tatgaagaac 780 
cagaagtag ,789 



<210> 111 
15 <211> 1728 
< ) 212> DNA 

<213> Streptococcus pneumoniae 



<400> 111 

atgtctaatg gacaactaat ttatttaatg gttgcaattg cagtcatttt agttctggct 60 
tatgtagtgg caatctttct acgtaagcga aacgagggga gattagaggc gctagaagaa 120 
agaaaagaag aactatacaa tcttccagta aatgatgaag tagaagctgt aaaaaatatg 180 
cacttgattg gacaaagtca agtggctttc cgtgaatgga atcaaaaatg ggtcgattta 240 
tctctcaact cttttgccga tattgaaaat aatctct'ttg aagcagaagg ttataaccat 3G0 
25 tcatttcgtt ttctcaaggc cagtcatcaa 'attgaccaaa ttgagagtca aattactttg 360 
attgaagaag atattgcggc aattcgcaat gctttggcag acttagagaa gcaagaatct .420 
aaaaatagtg gtcgtgttct tcatgctttg gatttatttg aggaacttca gcatagagtt 480 
gctgaaaatt cagaacagta tggtcaagcc ttggatgaaa ttgaaaaaca attagaaaat 540 
atccaatctg aattttcaca atttgtaacc ttgaattcat cgggtgaccc tgtggaagcc 600 
gcagtgattt tggataatac agaaaatcac attttggcct taagtcatat tgtggatcgt 660 
gttccagcct tggttacgac gctttctaca gaattgccag atcaattaca ggatttggaa 720 
gccggttatc gtaaactaat tgatgctaat tatcattttg ttgaaacgga tattgaagcg 780 
cgtttccact tgctttatga agcattcaag aaaaaccaag agaatattcg tcagttggaa 840 
ttggataatg ccgaatatga gaatggacag gcacaagagg aaatcaatgc cttgtatgat 900 
atttttactc gagaaattgc tgctcagaaa gtagtg'gaaa atctacttgc aactcttcca 960 
acttaccttc aacatatgaa agagaataat actttattgg ga'gaagatat tgcacgtttg 1020 
aacaagacct atttacttcc tgagacagct gcaagccatg ttcgtcgtat tcagacagaa 1080 
ttagagagtt ttgaggcagc tattgttgag gtaacttcaa atcaagaaga accaacccaa 114 0 
gcttattcag ttcttgaaga aaatcttgag gatttacaaa ctcaactaaa agatattgaa- 1200 
gatgagcaaa tttcagttag tgagcgcctg acacaaattg agaaagatga tattaatgca 1260 
cgtcaaaagg ccaatgttta tgtcaatcgt ctccatacta tcaagcgata catggaaaaa 1320 
cgcaatctgc caggtattcc acaaactttc ttgaagttat tctttacggc aagcaataat 1380 
accgaggatt taatggttga gttagaacaa aaaatgatta acattgaatc tgttacccga 1440, 
gttcttgaaa ttgcaacgaa tgatatggaa gctttagaaa cggaaactta taatattgta 1500 
caatatgcaa ctttgacaga gcaactcttg caatattcta accgctatcg ctcatttgat 1560 
gaacgcattc aagaagcatt taacgaagct ttagatattt. ttgaaaaaga atttgattat 1620 
cacgcttcat ttgataagat ttctcaagca ttggaagtgg cagagcctgg tgtaaccaat 1680 
cgctttgtta cctcatatga gaaaacacgt gaaacgattc gtttttaa 1728 

50 <210> 112 
<211> 2403 
<212> DNA 

<213> Streptococcus pneumoniae 
55 <400> 112 

atgcttatat cttataaatg gttaaaagaa ttggtggaca ttgatgtgcc atcacaagag 60 • 
ttggctgaaa aaatgtcaac tacaggaatc gaggtagagg gtgtcgaatc accagctgct 120 
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ggtctctcaa aaattgtcgt cggtgaggtc ttgtcttgcg aagatgtgcc agagactcac 180 
ctccatgttt gtcaggttaa cgttggcgaa gaagagcgtc agatcgtttg tggtgcccca 240 
aatgtgcgtg ctgggatcaa ggtcatggtg gctcttccag gagctcgtat, cgctgataac 300 
tacaaaatca aaaaaggaaa aatccgtggt ftggagtcac ttggaatgat. ctgttcactt 360 
ggtgaattgg gaatttc ( tga ctcagttgtg qctaaggaat tcgcagatgg catccaaatc 420 
ttgcctgaag atgccgtgcc aiggtgaggaa gtcttttctt -acctagactt ggatgatgaa 480 
atcatcgaac tttccatcac accaaaccgt gcagatgccc tttctatgtg tggagtggct 540 
cacgaagtgg cagccatcta tgacaaggca gtcaacttta aagaatttac tctaacagaa 600 
actaatgaag ctgcggcaga tgccctttct gtcagcattg agacagacaa ggcgccttac 660 
tatgcagctc gtatcttgga caatgtgacc atcgcaccaa gtccacaatg gttgcaaaac 720 
cttctcatga acgaaggaat ccgtcccatc aataacgtag tggacgtgac caact:acatc 780 
ctgctctatt ttggtcaacc aatgcatgcc tttgacttgg ataactttga agggactgac 84 0 
atccgtgtgc gtgaagcgcg tgctggtga^ aaattggtga ccttggacgg tgaa'gaacgt 900 
gacttggacgi tgaatgacct agtcatcact gtcgcagaca agccagtagc ccttgcaggt 960 
gtcatgggtg gtcaagcaac agaaatctct gaaaaatcta gtcgtgttgt ccttgaagct 1020 
gctgttttca atggca'aatc tatccgtaag acaagtggtc gcctgaacct tcgttctgag 1080 
tcatcttctc gctttgaaaa aggaattaat gtggcaacag ttaatgaagc ccttgatgcg 1140 
gcagctagcc tgattgcgga acttgcaggt gcgacggtgc gtaagggcat cgtttcagcg 1200 
ggtgagcttg atacttcaga tgtagaagtt tcttcaaccc ttgctgatgt taaccgtgtc 1260 
ctcggaactg agctgtctta tgctgatgta gaagacgtct tccgtc'gtct tggctttggt 1320 
ctttctggaa atgcagacag ctttacagtc agagtcccac gtcgtcgttg ggatatcaca 1380 
atcgaagctg acctctttga agaaattgct cgtatctatg gttatgaccg cttgccaact 14 40 
agtctaccaa aagacgatgg tacagcaggt gaattgacag ccacacaaaa actccgccgt 1500 
caagttcgta ctattgctga aggagcaggt ttgacagaaa tcatcaccta tactctaaca 1560 
actcctgaaa aagcagttga gtttacggct caaccaagta accttacgga actcatgtgg 1620 
ccaatgactg tggatcgttc agtcctccgt caaaatatga tttcaggtat ccttgatacc 1680 
gttgcctaca acgtggctcg taagaataaa aacttggccc tttacgagat tggaaaagtc 1740 
tttgaacaaa caggtaatcc aaaagaagaa cttccaaatg aaatcaacag ttttgccttt 1800 
gccttgacag gcttggttgc tgaaaaagat ttccaaacag cagcagttcc agttgatttc 18 60 
ttctatgcta agggaatcct tgaagcccta tttactcgtt tgggactcca agtaacctat 1920 
acagcaacat- ctgaaatcgc tagccttcat ccaggtcgta cagccgtgat ttcactcggt 1980 
gaccaagttc ttggtttcct tggccaagtg catccagtca ctgccaaggc ttacgatatt 2040 
ccagaaacgt atgtggctga gcttaacctt tcagctatcg aagctgcgct tcagccagcg 2100 
actccatttg tagaaatcac caaattcccg gbagtcagcc gtgacgttgc ccttctcctc 2160 
aaggcagaag tgactcatca agaagttgta gatgctatcc aagctgccgg cgtgaaacgt 2220 
ttgacagata tcaaactctt tgacgtcttc tcaggtgaga aattgggact tggtatgaag 2280 
tcaatggctt atagcttgac cttccaaaat ccagaagata gcttaacgga cgaagaagtc 234 0 
gcacgctata tggaaaaaat ccaagcatcg ctcgaagaaa aagtcaatgc agaagtgcgt 2400 
taa 2403 

<210> 113 
<211> 543 
<212> DNA 

<213> Streptococcus pneiomoniae 
<400> 113 

atgttagaaa acgatattaa aaaagtcctc gtttcacacg atgaaattac agaagcagct 60 
aaaaaactag gtgctcaatt aactaaagat tatgcaggaa aaaatccaat cttagttggg 120 
attttaaaag gatctattcc ttttatggct gaattggtca aacatattga tacacatatt 180 
gaaatggact tcatgatggt ttctagctac catggtggaa cagcaagtag tggtgttatc 240 
aatattaaac aagatgtgac tcaagatatc aaaggaagac atgttctatt tgtagaagat 300 
atcattgata caggtcaaac tttgaagaat ttgcgagata tgtttaaagc aagagaagca 360 
gcttctgtta aaattgcaac cttgttggat aaaccagaag gacgtgttgt agaaattgag 420 
gcagactata cctgctttac tatcccaaat gagtttgtag taggttatgg tttagactac 480 
aaagaaaatt atcgtaatct tccttatatt ggagtattga aagaggaagt gtattcaaat 540 
tag 543 
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<210> 114 

<211> 235 ' " 

<212> PRT 

<213> Streptococcus , pneumoniae ' 

5 • < 

<400> 114 n , , ' , . 

Met lie Tyr Ala Gly lie Leu Ala Gly Gly Thr Gly Thr Arg Met Gly 
1 5 10 .15 

10 lie Ser Asn Leu Pro Lys Gin Phe Leu Glu Leu Gly Asp Arg Pro lie 

20 ' 25 ' • , 30* 



15 



30 



45 



Leu lie His Thr lie Glu Lys Phe Va} Leu Glu Pro Ser lie Glu Lys 
35 ' 40 45 

i 

lie Val Val Gly Val His Gly Asp Trp.Val Leu His Ala Glu Asp Leu 
50 55 60 



Val Asp Lys Tyr Leu Pro Leu His Lys Glu Arg lie lie lie Thr Lys 
20 65 70 75 80 

Gly Gly Alk Asp Arg Asn . Thr Ser lie Glu Asn lie lie 1 Glu Ala lie 

85 90 95 

25 Asp Ala Tyr Arg Pro Leu Thr Pro Glu Asp lie Val Val Thr His Asp 

100 105 110 



Ser Val Arg Pro Phe lie Thr .Leu Arg Met lie Gin Asp Ser lie Lys 
115 . 120 125 

Leu Ala Gin Asn His Asp Ala Val Asp Thr Val Val Glu Ala Val Asp 
130 135 140 



Thr lie Val Glu Ser Thr Asn Gly Gin Phe He Thr Gly He Pro Asn 
35 145 150 155 160 

Arg Ala His Leu Tyr Gin Gly Gin Thr Pro Gin Thr Phe Arg Cys Lys 

165 ■ 170 175 

40 Asp Phe Met Asp Leu Tyr Gly Ser Leu Ser Asp Glu Glu Lys Glu He 

180 185 190 



Leu Thr Asp Ala Cys Lys He Phe Val He Lys Gly Lys Asp Val Ala 
195 200 205 

Leu Ala Lys Gly Glu Tyr Ser Asn Leu Lys He Thr Thr Val Thr Asp 
210 215 220 



Leu Lys He Ala Lys Ser Met He Glu Lys Asp 
50 225 230 235 



<210> 115 
<211> 185 
55 <212> PRT 

<213> Streptococcus pneumoniae 
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<400> 115 - 1 

Met Ala Asn Val lie He Glu Lys Ala Lys Glu Arg Met Thr Gin Ser 
1 5 10 15 

His Gin Ser Leu Ala Arg Glu Phe Gly Gly He Arg Ala Gly Arg Ala 

20 25 , 30 

Asn Ala Ser Leu Leu Asp Arg Val His Val Glu Tyr Tyr Gly Val Glu 
35 40 45 

Thr Pro Leu Asn Gin He Ala Ser He Thr He Pro Glu Ala Arg Val 
50 55 60 

* i 

i ■ ... 

Leu Leu Val Thr Pro Phe Asp Lys Ser Ser Leu Lys Asp lie Glu Ara 
65 : 70 j 75 8 ? 

Ala Leu Asn Ala Ser Asp Leu Gly He Thr Pro Ala Asn Asp Gly Serl 

85 90 95 

Val He Arg Leu Val He Pro Ala Leu Thr Glu Giu Thr Arg Arg Asp 

100 105 no* 

i 

Leu Ala Lys Glu Val Lys Lys Val Gly Glu Asn Ala Lys Val Ala Val 
115 120 125 

! ' 

Arg Asn He Arg Arg Asp Ala Met Asp Glu Ala Lys Lys Gin Glu Lys 
130 135 140 

Ala Gin Glu He Thr Glu Asp Glu Leu Lys Thr Leu Glu Lys Asp He 
145 150 155 160 

Gin Lys Val Thr Asp Asp Ala Val Lys His lie Asp Asp Met Thr Ala 

165 170 175 

Asn Lys Glu Lys Glu Leu Leu Glu Val 

180 185 



<210> 116 
<211> 450 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 116 

Met Gly Lys Tyr Phe Gly Thr Asp Gly Val Arg Gly Glu Ala Asn Leu 
1 5 .10 15 

Glu Leu Thr Pro Glu Leu Ala Phe Lys Leu Gly Arg Phe Gly Gly Tyr 

20 25 30 

Val Leu Ser Gin His Glu Thr Glu Ala Pro Lys Val Phe Val Gly Arg 
35 40 45 

4 

Asp Thr Arg He Ser Gly Glu Met Leu Glu Ser Ala Leu Val Ala Gly 
50 55 60 

Leu Leu Ser Val Gly He His Val Tyr Lys Leu Gly Val Leu Ala Thr 
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65 70 75 SO 

Pro Ala Val Ala Tyr Leu Val Glu Thr Glu Gly Ala Ser Ala Gly Val 

1 65, 90 * 95 

5 • . ■ ■ , ( 

Met lie Ser Ala Ser His Asn Pro Ala Leu Asp Asn Gly lie Lys ,Phe 

100 , 105 ♦ 110 

Phe Gly Gly Asp Gly Phe Lys Leu Asp Asp Glu Lys, Glu Ala Glu lie 
10 115 ' 120 ■ 125 

Glu Ala Leu Leu Asp Ala Glu Glu Asp Thr Leu Pro Arg Pro Ser Ala 

130 . 135 ■•. , , 140 



15 



20 



35 



50 



Glu Gly Leu Gly lie Leu Val Asp Tyr Pro Glu Gly Leu Arg Lys Tyr 
145 . ' 150 155. 160 

Glu Gly Tyr Arg Val Ser Thr Gly Thr Pro Leu Asp Gly Met Lys Val 

165 170 175 

Ala Leu Asp Thr Ala Asn Gly Ala Ala Ser Thr Ser Ala Arg Gin lie 

1 180 185 ' 190 



Phe Ala Asp Leu Gly Ala Gin Leu Thr Vai lie Giy Glu Thr Pro Asp 
25 195 200 205 

Gly Leu Asn lie Asn Leu Asn Val Gly Ser Thr His Pro Glu Ala Leu 
210 215 220 

30 Gin Glu Val Val Lys Glu Ser Gly Ser Ala lie' Gly Leu Ala Phe Asp 
225 230 ,235 240 



Gly Asp Ser Asp Arg Leu lie Ala Val Asp Glu Asn Gly Asp lie Val 

245 250 255 

Asp Gly Asp Lys lie Met Tyr He He Gly Lys Tyr Leu Ser Glu Lys 

260 265 . 270 



Gly Gin Leu Ala Gin Asn Thr He Val Thr Thr Val Met Ser Asn Leu 
40 275 280 285 

Gly Phe His Lys Ala Leu Asn Arg Glu Gly He Asn Lys Ala Val Thr 
290 295 300 

45 Ala Val Gly Asp Arg Tyr Val Val Glu Glu Met Arg Lys Ser Gly Tyr 
305 310 315 320 



Asn Leu Gly Gly Glu Gin Ser Gly His Val He Leu Met Asp Tyr Asn 

325 330 335 

Thr Thr Gly Asp Gly Gin Leu Ser Ala Val Gin Leu Thr Lys He Met 

340 345 350 



Lys Glu Thr Gly Lys Ser Leu Ser Glu Leu Ala Ala Glu Val Thr He 
55 355 360 365' 

Tyr Pro Gin Lys Leu Val Asn He Arg Val Glu Asn Val Met Lys Glu 
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370 



375 



380 



Lys Ala Met Glu Val Pro Ala lie Lys Ala lie lie Glu Lys Met Glu 
385 390 395 400 

I > , 

I 

Glu Glu Met Ala Gly Asn Gly Arg lie Leu Val Arg, Pro Ser Gly Thr 

405 1 410 ' 415 - ' 



10 



Glu Pro Leu Leu Arg Val Met Ala Glu Ala Pro Thr Thr Glu Glu Val 



420 



425 



430 



Asp Tyr Tyr Val Asp Thr lie Thr Asp Val Val Arg Ala Glu lie Gly 



435 



440 



445 



15 lie Asp 
450 



<210> 117 
20 <211> 234 
<212> PRT 

<213> Streptococcus pneumoniae 

<400> 117 | 
25 Met Lys Lys lie Leu lie Val Asp Asp Glu Lys Pro lie Ser Asp lie 
1 5 10 15 



30 



lie Lys Phe Asn Met Thr Lys Glu Gly Tyr Glu Val Val Thr Ala Phe 

20 25 30 

Asn Gly Arg Glu Ala Leu Glu Gin Phe Glu Ala Glu Gin Pro Asp lie 
35 40 45 



lie lie Leu Asp Leu Met Leu Pro Glu lie Asp Gly Leu Glu Val Ala 
35 50 55 60 

Lys Thr lie. Arg Lys Thr Ser Ser Val Pro lie Leu Met Leu Ser Ala 
65 70 75 80 

40 Lys Asp Ser Glu Phe Asp Lys Val lie Gly Leu Glu Leu Gly Ala Asp 

85 90 95 



45 



Asp Tyr Val Thr Lys Pro Phe Ser Asn Arg Glu Leu Gin Ala Arg Val 

100 105 110 

Lys Ala Leu Leu Arg Arg Sef Gin Pro Met Pro Val Asp Gly Gin Glu 
115 120 125 



Ala Asp Ser Lys Pro Gin Pro lie Gin lie Gly Asp Leu Glu lie Val 
50 130 135 140 



55 



Pro. Asp Ala Tyr Val Ala Lys Lys Tyr Gly Glu Glu Leu Asp Leu Thr 
145 150 155 160 

His Arg Glu Phe Glu Leu Leu Tyr His Leu Ala Ser His Thr Gly Gin 

165 170 175 
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Val lie Thr Arg Glu His Leu Leu Glu * Thr Val Trp Gly Tyr Asp Tyr 

180 185 190 

i Phe Gly .Asp Val Arg Thr Val Asp Val Thr Val Arg Arg - Leu Arg Glu 
5 195 ^ 200 205 ' , 

Lys lie Glu Asp Thr Pro Ser Arg Pro Glu Tyr lie Leu Thr Arg Arg 
210 215 220 

10 Gly Val Gly Tyr Tyr Met Arg Asn Asn Ala 
225 230 



<210> 118 ! , 

15 <211>,368 * | 

<212> PRT 

<213> Streptococcus pneumoniae. ' 
<400> 118 

20 Met Glu Glu lie Leu Cys lie Gly Cys 'Gly Ala Thr lie Gin Thr Thr 
1 5 . >10 15 



25 



Asp Lys Ala Gly Leu Gly Phe Thr Pro Gin Ser Ala Leu Glu Lys Gly 



20 



£0 



Leu Glu Thr Gly Glu Val Tyr Cys Gin Arg Cys Phe Arg Leu Arg His 
35 40 45 



Tyr Asn Glu lie Thr Asp Val Gin Leu Thr Asn Asp Asp Phe Leu Lys 

30 50 55 60 

t * 

Leu Leu His Glu Val Gly Asp Ser Asp Ala Leu Val Val Asn Val lie 

65 70 75 80 

35 Asp lie Phe Asp Phe Asn Gly Ser Val lie Pro Gly Leu Pro Arg Phe 

85 90 95 



40 



Val Ser Gly Asn Asp Val Leu Leu Val Gly Asn Lys Lys Asp lie Leu 

100 105 110 

Pro Lys Ser Val Lys Ser Gly Lys lie Ser Gin Trp Leu Met Lys Arg 
115 120 125 



Ala His Glu Glu Gly Leu Arg Pro Val Asp Val Val Leu Thr Ser Ala 

45 130 135 140 

Gin Asn Lys His Ala lie Lys Glu Val lie Asp Lys lie Glu His Tyr 

145 150 155 160 

50 Arg Lys Gly Arg Asp Val Tyr Val Val Gly Val Thr Asn Val Gly Lys 

165 . 170 175 



55 



Ser Thr Leu lie Asn Ala lie lie Gin Glu lie Thr Gly Asp Gin Asn 

180 185 190 

* 

Val lie Thr Thr Ser Arg Phe Pro Gly Thr Thr Leu* Asp Lys lie Glu 
195 200 205 
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10 



25 



35 



40 



50 



lie Pro Leu Asp Asp Gly Ser Tyr lie Tyr Asp Thr Pro Gly lie lie 

210 215 220 . 

His Arg His Gin Met' Ala His Tyr Leu Thr Ala Lys Asn Leu Lys Tyr 

225 230 235 . 240 

Val Ser Pro Lys Lys Glu lie Lys Pro Lys Thr Tyr Gin Leu Asn Pro 

245 250 255 

r i # 

Glu Gin Thr Leu Phe 1 Leu Gly Gly Leu' Gly Arg Phe Asp Phe, lie Ala 

260 265 270 ' 



■ i 

Gly Glu Lys Gin Gly Phe Thr Ala Phe Phe Asp Asn Glu Leu Lys Leu 
15 275 280 285 

His Arg Ser Lys Leu Glu Gly Ala Ser Ala Phe Tyr Asp Lys His Leu 
290 295 300 

20 Gly Thr Leu Leu Thr Pro Pro Asn Ser Lys Glu Lys Glu Asp Phe Pro 
305 310 315 320 



Arg Leu Val Gin His Val Phe Thr lie Lys Asp Lys Thr Asp Leu Val 

325 330 335 

lie Ser Gly Leu Gly Trp He Arg Val Thr Gly Thr Ala Lys Val Ala 

340 345 350 



Val Trp Ala Pro Glu Gly Val Ala Val Val Thr Arg Lys Ala He He 
30 ' 355 360 365 



<210> 119 
<211> 486 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 119 

Met Tyr Pro Asp Asp Ser Leu Thr Leu His Thr Asp Leu .Tyr Gin He 
15 10 15 



45 Asn Met Met Gin Val Tyr Phe Asp Gin Gly He His Asn Lys Lys Ala 

20 25 30 



Val Phe Glu Val Tyr Phe Arg Gin Gin Pro Phe Lys Asn Gly Tyr Ala 

, 35 ' 40 45 

Val Phe Ala Gly Leu Glu Arg He Val Asn Tyr Leu Glu Asp Leu Arg 
50 55 60 



Phe Ser Asp Ser Asp He Ala Tyr Leu Glu Ser Leu Gly Tyr His Gly 
55 65 70 75 80 

Ala Phe Leu Asp Tyr Leu Arg Asn Phe Lys Leu Glu Leu Thr Val Arg 
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85 90 95 : 

Ser Ala Gin Glu Gly Asp Leu Val Phe Ala Asn Glu Pro lie Val Gin 

100* 1 , 105 ■ ' ' ' 110 ' 

' ' i , 

Val Glu Gly Pro Leu Ala Gin Cys G^n Leu Val Glu Thr Ala Lett ( Leu 
115 120 125 

Asn lie Val Asn Tyr Gin Thr Leu Val Ala Thr Lys, Ala Ala Arg lie 
130 ■ 135 140 

i' i 

Arg Ser Val lie Glu Asp Glu Pro Leu Met Glu Phe Gly Thr Arg Arg 

145 150 1 , ■ i 155 160 

Ala Gin Glu Thr Asp Ala Ala lie Trp Gly Thr Arg Ala Ala Val lie 

165 . 170 175 

Gly Gly Ala Asn Gly Thr Ser Asn Val Arg Ala Gly Lys Leu Phe Asp 

180 185 ' 190 

i 

lie Pro Val Leu Gly Thr His Ala His Ala Leu Val Gin Val Tyr Gly 
195 200 205 

Asn Asp Tyr Glu Ala Phe Lys Ala Tyr Ala- Ala Thr His Lys Asn Cys 
210 215 220 

Val Phe Leu Val Asp Thr Tyr Asp Thr Leu Arg lie Gly Val Pro Ala 
225 230 235 240 

Ala lie Gin Val Ala Arg Glu Leu Gly Asp Gin lie Asn Phe Met Gly 

245 250 255 

Val Arg lie Asp Ser Gly Asp lie Ala Tyr lie Ser Lys Lys Val Arg 

260 265 270 

Gin Gin Leu Asp Glu Ala Gly Phe Thr Glu Ala Lys lie Tyr Ala Ser 
275 280 285 

Asn Asp Leu Asp Glu Asn Thr lie Leu Asn Leu Lys Met Gin Lys Ala 
290 295 300 

4 

Lys lie Asp Val Trp Gly Val Gly Thr Gin Leu lie Thr Ala Tyr Asp 
305 310 315 320 

Gin Pro Ala Leu Gly Ala Val Tyr Lys lie Val Ala lie Glu Asp Glu 

325 330 335 

Thr Gly Gin Met Arg Asn Thr lie Lys Leu Ser Asn Asn Ala Glu Lys 

340 345 350 

Val Ser Thr Pro Gly Lys Lys Gin Val Trp Arg lie Thr Ser Arg Glu 
355 360 365 

Lys* Gly Lys Ser Glu Gly Asp Tyr lie Thr Tyr Asp Gly Val Asp lie 
370 375 380 

Ser Asp Met Thr Glu He Lys Met Phe His Pro Thr Tyr Thr Tyr He 
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385 



390 



395 



400 



Lys Lys Thr Val Arg Asn Phe Asp Ala Val Pro Leu Leu Val Asp lie 

405 410 415 



Phe Lys Glu Gly lie Leu Val Tyr Asn Leu Prq Ser L$u Thr Asp Il£ 

420 425 ' i 430 

Gin Asp Tyr Ala Arg Lys Glu Phe Asp Lys Leu Trp Asp Glu Tyr Lys 
10 435 440 445 



15 



20 



Arg Val Leu Asn Pro Gin His Tyr Pro Val Asp Leu Ala Arg Asp Val 



450 



455 



4 60 



Trp Gin Asp Lys Met Asp Leu lie Asp Lys Met Arg Lys Glu Ala Leu 
465 470 ! 475 ' 480 



Gly Glu Gly Glu Glu Glu 

485 



25 



30 



<210> 120 
<211> 283 

<212> PRT | 
<213> Streptococcus pneumoniae! i 

<400> 120 

Met Ala Thr lie Gin Trp Phe Pro Gly His Met Ser Lys Ala Arg Arg 
1 .5 10 15 

Gin Val Gin Glu Asn Leu Lys Phe Val Asp Phe Val Thr lie Leu Val 

20 25 30 



Asp Ala Arg Leu Pro Leu Ser Ser Gin Asn Pro Met Leu Thr Lys lie 
35 .35 40 45 

Val Gly Asp Lys Pro Lys Leu Leu lie Leu Asn Lys Ala Asp Leu Ala 
50 55 60 

40 Asp Pro Ala Met Thr Lys Glu Trp Arg Gin Tyr Phe Glu Ser Gin Gly 
65 70 75 80 



45 



lie Gin Thr Leu Ala lie Asn Ser Lys Glu Gin Val Thr Val Lys Val 

85 90 95 

Val Thr Asp Ala Ala Lys Lys Leu Met Ala Asp Lys lie Ala Arg Gin 

100 105 110 



Lys Glu Arg Gly lie Gin lie Glu Thr Leu Arg Thr Met lie lie Gly 
50 115 120 125 



lie Pro Asn Ala Gly Lys Ser Thr Leu Met Asn Arg Leu Ala Gly Lys 
130 135 140 

55 Lys lie Ala Val Val Gly Asn Lys Pro Gly Val Thr Lys Gly Gin Gin 
145 150 155 160 
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10 



15 



Trp Leu Lys Thr Asn Lys Asp Leu Glu lie Leu Asp Thr Pro Gly lie 

165 170 175 

Leu Trp Pro Lys Phe Glu Asp Glu Thr Vai Ala Leu Lys Leu Ala Leu 

180 185 190 

! 

I i 

Thr Gly Ala lie Lys Asp Gin Leu Leu Pro Met Asp Glu Val Thr lie 
195 200 205 

Phe Gly lie Asn Tyr Phe Lys Glu His Tyr Pro Glu Lys Leu Ala Qlu 
210 215 220 

i i 

Arg Phe Lys Gin Met Lys lie Glu Glu Glu Pro Ser Val lie lie Met 



225 



230 



235 



240 



Asp Met Thr Arg Ala Leu Gly Phe Arg Asp Asp Tyr Asp Arg Phe Tyr 

245 250 255 1 



Ser Leu Phe Val Lys Glu Val Arg Asp Gly Lys Leu Gly Asn Tyr Thr 
20 260 2 65 270 

Leu Asp Thr Leu Glu Asp Leu Asp Gly Asn Asp 
275 * 280 



25 



30 



35 



<210> 121 
<211> 156 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 121 

Met lie Asn Asn Val. Val Leu Val Gly Arg Met Thr Arg Asp Ala Glu 
1*5, 10 15 

Leu Arg Tyr Thr Pro Ser Asn Val Ala Val Ala Thr Phe Thr Leu Ala 

20 25 30 



40 



Val Asn Arg Thr Phe Lys Ser Gin Asn Gly Glu Arg Glu Ala Asp Phe 
35 40 45 

lie Asn Val Val Met Trp Arg Gin Gin Ala Glu Asn Leu Ala Asn Trp 
50 55 60 



Ala Lys Lys Gly Ser Leu lie Gly Val Thr Gly Arg lie Gin Thr Arg 
45 65 70 75 * 80 

Ser Tyr Asp Asn Gin Gin Gly Gin Arg Val Tyr Val Thr Glu Val Val 

85 90 95 

50 Ala Glu Asn. Phe Gin Met Leu Glu. Ser Arg Ser Val Arg Glu Gly His 

100 105 110' 



55 



Thr Gly Gly Ala Tyr Ser Ala Pro Thr Ala Asn Tyr Ser Ala -Pro Thr 

115 ' 120 125 

Asn Ser Val Pro Asp Phe Ser Arg Asn Glu Asn Pro Phe Gly Ala Thr 

130 135 140 
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Asn Pro Leu Asp lie Ser ,Asp Asp Asp Leu Pro Phe 
145 150 155 

■ ■ - ■■ 

<210> 122 u 

<211> 324 , ' 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 122 ' ' , , 1 

Met Lys Thr Arg lie Thr Glu Leu Leu Lys' lie . Asp Tyr Pro lie Phe 

1 5 \ . , 10 15 

■ > * 1 

Gin Gly Gly Met A>la Trp Val Ala Asp Gly Asp< Leu Ala Gly Ala Val 

20 25. ' 30 

i 

Ser Lys Ala Gly Gly Leu Gly lie lie Gly Gly Gly Asn Ala Pro Lys 

35 40 ' 45 

i 

Glu Val Val Lys Ala Asn lie Asp Lys lie Lys Ser Leu Thr Asp Lys 
50 i 55 60 

Pro Phe Gly Val Asn lie Met Leu Leu Ser Pro Phe Val Glu Asp lie 
65 70 75 80 

Val Asp Leu Val lie Glu Glu Gly Val Lys Val Val Thr Thr Gly Ala 

85 90 95 

i 

Gly Asn Pro Ser Lys Tyr Met Glu Arg Phe His Glu Ala Gly lie lie 

100 105 110 

Val lie Pro Val Val Pro Ser Val Ala Leu Ala Lys Arg Met Glu Lys 
115 120 125 

He Gly Ala Asp Ala Val He Ala Glu Gly Met Glu Ala Gly Gly His 
130 135 140 

He Gly Lys Leu Thr Thr Met Thr Leu Val Arg Gin Val Ala Thr Ala 
145 150 155 . 160 

He Ser He Pro Val He Ala Ala Gly Gly lie' Ala Asp Gly Glu Gly 

165 170 175 

Ala Ala Ala Gly Phe Met Leu Gly Ala Glu Ala Val Gin Val Gly Thr 

180 185 190 

* 

Arg Phe Val. Val Ala Lys Glu Ser Asn Ala His Pro Asn Tyr Lys Glu 
195 200 205 

Lys He Leu Lys Ala Arg Asp He Asp Thr Thr He Ser Ala Gin His 
210 215 220 

Phe Gly His Ala Val Arg Ala He Lys Asn Gin Leu Thr Arg Asp Phe 
225 230 235 240 

Glu Leu Ala Glu Lys Asp Ala Phe Lys Gin Glu Asp Pro Asp Leu Glu 



58 



5 



30 



45 
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j 

245 ' 250 255 

> 

lie Phe Glu Gin Met Gly Ala Gly Ala Leu Ala Lys Ala Val Val His 

260 265 . , 270 ' ' 

Gly Asp Val Glu Gly Gly Ser Val Met Ala Gly Gin , lie Ala Gly Leu > 
275 280 285 



Val Ser Lys Glu Glu Thr Ala Glu- Glu lie Leu Lys Asp Leu Tyr Tyr 

10 290 295 1 300 

Gly Ala Ala Lys Lys He Gin Glu Glu Ala Ser, Arg Trp Thr Gly Val. 

305 310 315 * ■ 320 

i * * 

15 Vial Arg Asn Asp j 



<210> 123 
20 <211> 140 
<212> PRT 

<213> Streptococcus pneumoniae 

<400> 123 j 
25 Met He Asp He Gin Gly He Lys Glu Ala Leu Pro His Arg Tyr Pro 
1 5 10 15 



Met Leu Leu Val Asp Arg Val Leu Glu Val Ser Glu Asp Thr He Val 

20 25 30 

Ala He Lys Asn Val Thr He Asn Glu Pro Phe Phe Asn Gly His Phe 

35 4 0 45 



Pro Gin Tyr Pro Val Met Pro Gly Val Leu He Met Glu Ala Leu Ala 

35 50 - * 55 60 

Gin Thr Ala Gly Val Leu Glu Leu Ser Lys Pro Glu Asn Lys Gly Lys 

65 70 75 80 

* 

40 Leu Val Phe Tyr Ala Gly Met Asp Lys Val Lys Phe Lys Lys Gin Val 

85 90 95 



Val Pro Gly Asp Gin Leu Val Met Thr Ala Thr Phe Val Lys Arg Arg 

100 105 110 

Gly Thr He Ala Val Val Giu Ala Lys Ala Glu Val Asp Gly Lys Leu 
115 120 125 



Ala Ala Ser Gly Thr Leu Thr Phe Ala He Gly Asn 
50 130 135 140 



<210> 124 
<211> 340 
55 <212> PRT 

<213> Streptococcus pneumoniae 
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40 
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<400> 124 , • . , 

Met lie Asn Gin lie Tyr Gin Leu Thr Lys Pro Lys Phe lie Asri Val 
• 1 5 1 10 , 15 

Lys Tyr Gin Glu Glu Ala lie Asp Gin Glu Asn pis lie Leu lie Arg ! 

20 25 1 30 

r 

Pro Asn Tyr Met Ala Val Cys His Ala Asp Gin Arg Tyr Tyr Gin Gly 
35 40 45 

i - - . ■ 

Lys Arg Asp Pro Lys lie 1 Leu Asn Lys' Lys Leu Hro Met Ala Met ' lie 

50 '55 1 ' ' 60 1 



His Glu Ser Cys Gly lie Val lie 'Ser Asp Pro Ser' Gly Thr Tyr Glu 

15 65 70 * 75 80 

i ' ■ 

Val Gly Gin Lys Val Val Met lie Pro Asn Gin Ser Pro Met Gin Ser 

85 90 95 

20 Asp Glu Glu Phe Tyr Glu Asn Tyr Met Thr Gly Thr His Phe Leu Ser 

100 105 ' 110 

■ i • ■ ' 

Ser Gly Phe Asp Gly Phe Met Arg Glu Phe Val Ser Leu Pro Lys Asp 
115 120 125 



Arg Va'l Val Ala Tyr Asp Ala lie Glu Asp Thr Val Ala Ala lie Thr 
130 135 140 



Glu Phe Val Ser Val Gly Met His Ala Met Asn Arg Leu Leu Thr Leu 
30 145 150 155 160 

Ala His Ser Lys Arg Glu Arg lie Pro Val lie Gly Asp Gly Ser Leu 

165 170 175 

35 Ala Phe Val Val Ala Asn lie lie Asn Tyr Thr Leu Pro Glu Ala Glu 

180 185 190 



lie Val Val lie Gly Arg His Trp Glu Lys Leu Glu Leu Phe Ser Phe 
195 200 205 

Ala Lys Glu Cys Tyr lie Thr Asp Asn lie Pro Glu Glu Leu Ala Phe 
210 215 ' 220 



Asp His Ala Phe Glu Cys Cys Gly Gly Asp Gly Thr Gly Pro Ala lie 
45 225 230 235 240 

Asn Asp Leu lie Arg Tyr lie Arg Pro Gin Gly Thr lie Leu Met Met 

245 250 255 

50 Gly Val Ser Glu Tyr Lys Val Asn Leu Asn Thr Arg Asp Ala Leu Glu 

260 265 270 



Lys Gly Leu Leu Leu Val Gly Ser Ser Arg Ser Gly Arg lie Asp Phe 
275 280 285 

Glu Asn Ala lie Gin Met Met Lys Val Lys Lys Phe Ala Asn Arg Leu 
290 295 300 
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10 



20 



35 



50 



Lys Asn lie Leu Tyr Leu' ( Glu Glu Pro Val Arg Glu lie Lys Asp lie 

305 310 315 320 

• i , \..., , , ... 

• i ' . 

His Arg Val Phe Ala Thr Asp Leu Asn Thr Ala Pl>e Lys Thr Val Phe 

■« 325 330 335 , 

i 

Lys Trp Glu Val 

340 



<210> 125 

<211> 447 

<2125> PRT 1 ' 

15 <213> Streptococqus pneumoniae 



<400> 125 

Met Asn Leu Lys Thr Thr Leu Gly Leu Leu Ala Gly Arg Ser Ser His 
1 5 10 15 

\ 

Phe Val Leu Ser Arg Leu Gly Arg Gly Ser Thr Leu' Pro Gly Lys Val 

1 20 25 '30 



A±a lieu urn fne Asp> .uys asp xj.e lieu vjj.n nsn iieu axa Lys Asn Tyr 

25 35 40 45 

Glu lie Val Val Val Thr Gly Thr Asn Gly Lys Thr Leu Thr Thr Ala 

50 55 60 

30 Leu Thr Val Gly lie Leu Lys Glu Val Tyr Gly Gin Val Leu Thr Asn 

65 70 75 80 



Pro Ser Gly Ala Asn Met lie Thr Gly lie Ala Thr Thr Phe Leu Thr 

85 90 95 

Ala Lys Ser Ser Lys Thr Gly Lys Asn lie Ala Val Leu Glu lie Asp 

100 105 110 



Glu Ala Ser Leu Ser Arg lie Cys Asp Tyr lie Gin Pro Ser Leu Phe 
40 115 120 125 

Val lie Thr Asn lie Phe Arg Asp Gin Met Asp Arg Phe Gly Glu lie 
130 135 140 

* 

45 " Tyr Thr Thr Tyr Asn Met lie Leu Asp Ala lie Arg Lys Val Pro Thr 
145 150 155 160 



Ala Thr Val Leu Leu Asn Gly Asp Ser . Pro Leu Phe Tyr Lys Pro Thr 

165 170 175 

lie Pro Asn Pro lie Glu Tyr Phe Gly Phe Asp Leu Glu Lys Gly Pro 

180 185 190 



Ala Gin Leu Ala His Tyr Asn Thr Glu Gly lie Leu Cys Pro Asp Cys 
55 195 200 . 205 

Gin Gly lie Leu Lys Tyr Glu His Asn Thr Tyr Ala Asn Leu Gly Ala 
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210 215 ! 220 

Tyr lie Cys Glu Gly Cys Gly Cys Lys Arg. Pro Asp Leu Asp Tyr Arg 
225 230 235 240 

5 ■ ■ ' - ; . : , •; - 

Leu Thr Lys Leu Val Glu Leu Thr Asn Asn Arg Ser Arg Phe Val Il£ 

245 2'50 255 1 ' 

Asp Gly Gin Glu Tyr Gly lie Gin lie Gly Gly Leu Tyr Asn lie Tyr 
10 260 265 , 270 

Asn Ala Leu Ala Ala Val Ala lie Ala Arg Phe Leu Gly Ala Asp Ser 

275 28|0 285 ' ' , 

15 Gin Leu lie Lys Gin Gly Phe Asp Lys Ser Arg Ala Val Phe Gly Arg 
290 295 ! 300 



20 



35 



45 



50 



Gin Glu Thr Phe His lie Gly Asp Lys Glu Cys Thr Leu Val Leu lie 
305 310 315 320 

Lys Asn Pro Val Gly Ala Thr Gin Ala lie Glu Met lie Lys Leu Ala 

325 330 335 



i ■ < 



Pro Tyr Pro Phe Ser Leu Ser Val Leu Leu Asn -Ala Asn Tyr Ala Asp 

25 340 ; 345 350 

Gly lie Asp Thr Ser Trp lie Trp Asp Ala Asp Phe Glu Gin lie Thr 
355 360 365 

30 Asp Met Asp lie Pro Glu lie Asn Ala Gly Gly Val Arg His Ser Glu 
370 375 380 



lie Ala Arg Arg Leu Arg Val Thr Gly Tyr Pro Ala Glu Lys lie Thr 
385 390 395 400 

Glu Thr Ser Asn Leu Glu Gin Val Leu Lys Thr He Glu Asn Gin Asp 

405 410 415 



Cys Lys His Ala Tyr He Leu Ala Thr Tyr Thr Ala Met Leu Glu Phe 
40 420 425 430 



Arg Glu Leu Leu Ala Ser Arg Gin He Val Arg Lys Glu Met Asn 
435 440 445 



<210> 126 
<211> 260 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 126 

Met Val Tyr Thr Ser Leu Ser Ser Lys Asp Gly Asn Tyr Pro Tyr Gin 
1 5 10 15 

55 Leu Asn He Ala His Leu Tyr Gly Asn .Leu Met Asn Thr Tyr Gly Asp 

20 25 30 
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i 

! 

Asn Gly Asn lie Leu Met Leu Lys Tyr Val Ala Glu Lys Leu Gly Ala 
35 40 45 ' 

, His Val Thr Val Asp lie Val Ser Leu His Asp Asp Phe Asp Glu Asn . ' 

5 50 55 60 ' , ■ 

* , ' 

His Tyr Asp lie Ala Phe Phe Gly Gly Gly Gin Asp Phe Glu Gin Ser 
65 .70 75 80 

10 He He Ala Asp Asp Leu Pro Ala Lys Lys Glii Ser Il,e Asp Asn Tyr 

85 90 95 



15 



30 



45 



55 



lie Gin Asn Asp Gly Val Val Leu Ala He Cys Gly Gly Phe Gin Leu 

100 i 105 110 

i 

. ' 1 ! 

Leu Gly Gin Tyr Tyr Val Glu Ala Ser Gly Lys Arg lie Glu Gly Leu 
115 120 125 1 



i ■ 



Gly Val Met Gly His Tyr Thr Leu Asn Gin Thr Asn Asn Arg Phe lie 
20 130 135 140 

Gly Asp lie Lys lie His Asn Glu Asp Phe Asp Glu Thr Tyr Tyr Gly 
145 150 155 160 

I , 

25 Phe Glu Asn His Gin Gly Arg Thr Phe Leu Ser Asp Asp Gin Lys Pro 

165 170 175- 



Leu Gly Gin Val Val Tyr Gly Asn Gly Asn Asn Glu Glu Lys Val Gly 

180 185 190 

Glu Gly Val His Tyr. Lys Asn Val Phe Gly Ser Tyr Phe His Gly Pro 
195 200 205 



lie Leu Ser Arg Asn Ala Asn Leu Ala Tyr Arg Leu Val Thr Thr Ala 
35 210 215 220 

Leu Lys Lys Lys Tyr Gly Gin Asp lie Gin Leu Pro Ala Tyr Glu Asp 

225 230 235 240 

40 lie Leu Ser Gin Glu lie Ala Glu Glu Tyr Ser Asp Val Lys Ser Lys 

245 . 250 255 



Ala Asp Phe Ser 

260 



<210> 127 • * 

<211> 223 
<212> PRT 
50 <213> Streptococcus pneumoniae 



<400> 127 

Met Asn Val Lys Glu Asn Thr Glu Leu Val Phe Arg Glu Val Ala Glu 
15 10 15 

Ala Ser Leu Ser Ala Asn Arg Glu Ser Gly Ser Val Ser Val fie Ala 

20 25 30 
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Val Thr Lys Tyr Val Asp. Val Pro Thr Ala Glu Ala Leu Leu Pro Leu 

35 40 ■ , ; 45 ' 



Gly Val His His He Gly Glu Asn Arg Val Asp Lys Phe Leu Glu Lys 

50 ■ 55 60 / 

Tyr Glu Ala Leu Lys Asp Arg Asp Val Thr Trp His Leu He Gly Thr 
65 70 75 80 

i ' ■ 

Leu Gin Arg Arg Lys Val Lys Asp Val He Gin ■ Tyr Val Asp' ■ Tyr Phe, 

85 90 . 95 

His Ala Leu Asp Ser Val Lys Leu Ala Gly Glu He Gin Lys Arg Ser 

100 i 105 no 

Asp Arg Val He Lys Cys Phe Leu Gin Val Asn He Ser Lys Glu Glu 
115 120 , , 125 

Ser Lys His Gly Phe Ser Arg Glu Glu Leu Leu Glu lie Leu Pro Glu 
130 135 140 

Leu Ala Gly Leu Asp Lys lie Glu Tyr Val Gly Leu Met Thr Met Ala 
145 155 160 

Pro Phe Glu Ala Ser Ser Glu Gin Leu Lys Glu lie Phe Lys Ala Ala 

• 165 170 175 

Gin Asp Leu Gin Arg Glu He Gin Glu Lys Gin He Pro Asn lie Pro 

180 .185 190 

Met Thr Glu Leu Ser Met Gly Met Ser Arg Asp Tyr Lys Glu Ala lie 
195 ' 200 205 

Gin Phe Gly Ser Thr Phe Val' Arg He Gly Thr Ser Phe Phe Lys 
210 215 220 



<210> 128 
<211> 279 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 128 

Met Gly He Ala Leu Glu Asn Val Asn Phe Thr Tyr Gin Glu Gly Thr 
15 10 15 

Pro Leu Ala Ser Ala Ala Leu Ser Asp Val Ser Leu Thr lie Glu Asp 

'20 25 30 

Gly Ser Tyr Thr Ala Leu lie Gly His Thr Gly Ser Gly Lys Ser Thr 
35 40 45 

He .Leu Gin Leu Leu Asn Gly Leu Leu Val Pro Ser Gin Gly Ser Val 
50- 55 60 

Arg Val Phe Asp Thr Leu lie Thr Ser Thr Ser Lys Asn Lys Asp He 
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■ 

65 70 75 80 

t 

Arg Gin lie Arg Lys Gin Val Gly Leu Val Phe Gin Phe Ala Glu Asn 

85. ' 90 1 95 

5 • ' 

Gin lie Phe Glu Glu Thr Val Leu Lys Asp Val' Ala Phe Gly Pro Gin 

100 , 105 ' 110 

Asn Phe Gly Val Ser Glu Glu Asp Ala Val Lys Thr , Ala Arg Glu Lys 
10 115 , ] 120 ' • 1 125 

Leu Ala Leu Val Gly lie Asp Glu Ser Leu Phe. Asp Arg Ser Pro Phe 
130 135 • t 140 , 

15* Glu Leu Ser Gly Gly Gin Met Arg Arg Val Ala' lie Ala Gly lie Leu 
145 150 155 160 



20 



30 



35 



Ala Met Glu Pro Ala lie Leu Val Leu Asp Glu Pro Thr Ala Gly Leu 

165 170 175 

* * * * 

Asp Pro. Leu Gly Arg Lys Glu Leu Met Thr Leu Phe Lys Lys Leu His 

1 180 185 ' 190 



Gin Ser Gly Met Thr lie Val Leu Val Thr His Leu Met Asp Asp Val 

25 195 200 - 205 

Ala Glu Tyr Ala Asn Gin Val Tyr Val Met Glu Lys Gly Arg Leu Val 

210 215 220 



Lys Gly Gly Lys Pro Ser Asp Val Phe Gin Asp Val Val Phe Met Glu 

225 230 235 240 

Glu Val Gin Leu Gly Val Pro Lys lie Thr Ala Phe Cys Lys Arg Leu 

245 250 255 

Ala Asp Arg Gly Val Ser Phe Lys Arg Leu Pro Val Lys lie Glu Glu 

260 265 270 



Phe Lys Glu Ser Leu Asn Gly 
40 275 



<210> 129 
<211> 309 
45 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 129 

Met Asp lie Gin Phe Leu Gly Thr Gly Ala Gly Gin Pro Ser Lys Ala 
50 1 5 10 - 15 

Arg Asn Val Ser Ser Leu Ala Leu Lys Leu Leu Asp Glu lie Asn Glu 

20 . 25 30 

55 Val Trp Leu Phe Asp Cys Gly Glu Gly Thr Gin Asn Arg lie Leu Glu 

35 40 45 
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i 

Thr Thr lie Arg Pro Arg Lys Val Ser Lys lie Phe. lie Thr, His Leu 
50 55 60 ' 

i t 

His Gly Asp His lie Phe Gly Leu Pro Gly Phe Leu Ser Ser Arg Ala • 
5' 65 70 75 -80 ' ' 

. i it 

Phe Gin Ala Asn Glu Glu Gin Thr Asp Leu Glu lie Tyr -Gly Pro Gin 

85 90 95 

i 

t 

10 Gly lie Lys Ser Phe Val Leu Thr Ser Leu Arg Val Ser Gly Ser Arg 

100 105 110 



15 



30 



45 



Leu Pro Tyr Arg He His Phe His Glu Phe Asp Gin Asp Ser Leu Gly 
115 120 , 125 

Lys lie Leu Glu lie Asp Lys Phe Thr Val Tyr Ala Glu Glu Leu Asp 

130 135 140 I 



His Thr lie Phe Cys Val Gly Tyr Arg Val Met Gin Lys Asp Leu Glu 
20 145 150 155 160 

.Gly Thr Leu Asp Ala Glu Lys Leu Lys Ala Ala Gly Val Pro Phe Gly 

165 170 175 

i ■ 

25 Pro Leu Phe Gly Lys lie. Lys Asn Gly Gin Asp Leu Val Leu Glu Asp 

180 185 190' 



Gly Thr Glu lie Lys Ala Ala Asp Tyr lie Ser Ala Pro Arg Pro Gly 
195 200 205 

Lys lie lie Thr lie Leu Gly Asp Thr Arg Lys Thr Asp Ala Ser Val 
210 215 220 



Arg Leu Ala Val Asn Ala Asp Val Leu Val His Glu Ser Thr Tyr Gly 
35 225 230 235 240 

Lys Gly Asp Glu Lys lie Ala Arg Asn His Gly His Ser Thr Asn Met 

245 250 255 

40 Gin Ala Ala Gin Val Ala Val Glu Ala Gly Ala Lys Arg Leu Leu Leu 

260 265 270 



Asn His He Ser Ala Arg Phe Leu Ser Lys Asp He Ser Lys Leu Lys 
275 280 285 

Lys Asp Ala Ala Thr He Phe Glu Asn Val His Val Val .Lys Asp Leu 

290 295 300 



Glu Glu Val Glu He 
50 305 



<2.1'0> 130 
<211> 553 
55 <212> PRT 

<213> Streptococcus pneumoniae 
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25 



40 
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■ • 

<400> 130 

Met Ser Asn lie Ser Leu'.Thr Thr Leu Gly Gly Val Arg Glu Asn Gly 
1 f 5 10 15 

Lys Asn Met Tyr ITe Ala Glu lie Gly Glu Ser lie Phe Val Leu Asn 

20 25 , 30 ' 

t 

Val Gly Leu Lys Tyr Pro Glu Asn Glu Gin Leu Gly Val Asp Val Val 
35 40 . 45 



lie Pro Asn Met Asp Tyr Leu Phe Glu Asn Ser -Asp Arg lie' Ala Gly 
50 55 60 

Val Phe Leu Thr His Gly His Ala Asp Ala lie Giy Ala Leu Pro Tyr 
15 65 i 70 75' 80 

Leu Leu Ala Glu Ala Lys Val Pro Val Phe Gly Ser Glu Leu Thr lie 

85 , 90 95 

20 Glu Leu Ala Lys Leu Phe Val Lys Gly Asn Asp Ala Val Lys Lys Phe 

100 105 . 110 



Asn Asp Phe His Val lie Asp Glu Asn Thr Glu lie Asp Phe Gly Gly 
115 120 125 

Thr Val Val Ser Phe Phe Pro Thr Thr Tyr Ser Val Pro Glu Ser Leu 
130 135 140 



Gly lie Val Leu Lys Thr Ser Glu Gly Ser lie Val Tyr Thr Gly Asp 

30 145 150 155 160 

Phe Lys Phe Asp Gin Thr Ala Ser Glu Ser Tyr Ala Thr Asp Phe Ala 

165 170 175 

35 Arg Leu Ala Glu lie Gly Arg Asp Gly Val Leu Ala Leu Leu Ser Asp 

180 185 190 



Ser Ala Asn Ala Asp Ser Asn lie Gin Val Ala Ser Glu Ser Glu Val 
195 200 205 

Arg Asp Glu lie Thr Gin Thr lie Ala Asp Trp Glu Gly Arg lie lie 
210 215 220 



Val Ala Ala Val Ser Ser Asn Leu Ser Arg lie Gin Gin lie. Phe Asp 
45 225 230 235 240 

Ala Ala Asp Lys Thr Gly Arg Arg lie Val Leu Thr Gly Phe Asp lie 

245 250 255 

50 Glu Asn lie Val Arg Thr Ala lie Arg Leu Lys Lys Leu Ser Leu Ala 

260 265 270 



Asn Glu lie Leu Leu lie Lys Pro Lys Asp Met Ser Arg Phe Glu Asp 
275 280 285 

His Glu Leu lie lie Leu Glu Thr Gly Arg Met Gly Glu Pro lie Asn 
290 295 300 
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Gly Leu Arg Lys Met Ser lie Gly Arg His Arg Tyr Val Glu He Lys 
305 310 315 320 



i < 



Asp Gly Asp Leu Val Tyr He Ala Thr Ala Pro Ser He Ala Lys Glu 

325 330 ' . - 335 



10 



15 



Ala Phe Val Ala Arg Val Glu Asn Met He" Tyr Gin Ala Gly Gly Val 

340 ■ 345 350 

Val Lys Leu He Thr Gin Ser Leu His Val Ser Gly His Gly Asn Val 
355 360 365 



Arg Asp Leu Gin Leu Met He Ash Leu Leu Gin Pro Lys Tyr Leu Phe 
370 375 ! 380 



20 



Pro Val Gin Gly Glu Tyr Arg Glu Leu Asp Ala His Ala Lys Ala Ala' 
385 390 ! 395 400 

Met Ala Val Gly Met Leu Pro Glu Arg He Phe He Pro Lys Lys Gly 

405 410 415 



25 



Thr Thr Met Ala Tyr Glu Asn Gly Asp Phe Val Pro Ala Gly Ser Val 

420 j 425 430 

i 

Ser Ala Gly Asp He Leu lie Asp Gly Asn Ala He Gly Asp Val Gly 
435 440 445 



Asn Val Val Leu Arg Asp Arg Lys Val Leu Ser Glu Asp Gly He Phe 
30 450 455 460 

He Val Ala He Thr Val Asn Arg Arg Glu Lys Lys He Val Ala Arg 
465 470 475 480 

* • 

35 Ala Arg Val His Thr Arg Gly Phe Val Tyr Leu Lys Lys Ser Arg Asp 

485 ■ 490 495 



40 



He Leu Arg Glu Ser Ser Glu Leu He Asn Gin Thr Val Glu Asp Tyr 

500 505 510 

Leu Gin Gly Asp Asp Phe Asp Trp Ala Asp Leu Lys Gly Lys Val Arg 
515 520 525 



Asp Asn Leu Thr Lys Tyr Leu Phe Asp Gin Thr Lys Arg Arg Pro -Ala 
45 530 535- 540 



50 



55 



He Leu Pro Val Val Met Glu Ala Lys 
545 550 



<210> 131 
<211> 316 
<212> PRT 

<213> Streptococcus pneumoniae 



<400> 131 

Met Thr Lys Glu Phe His His Val Thr Val Leu Leu His Glu Thr He 
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10 



15 



Asp Met Leu Asp Val Lys Pro Asp Gly lie Tyr Val Asp Ala Thr Leu 

20. 25 . ' ' 30 

5 

Gly Gly Ala Gly His Ser Glu Tyr Leu Leu Ser Lys 'Leu Ser Glu Lys< 
'35 40 45 

Gly His Leu Tyr Ala Phe Asp Gin Asp Gin Asn Ala lie Asp Asn Ala 
10 50 55 , 1 60 1 

Gin Lys Arg Leu Ala Pro Tyr Ilje Glu Lys Gly Val Val Thr Phe lie 1 
65 , 70 I 75 , ,80 

* 

15 Lys Asp Asn Phe Arg His Leu Gin Ala Arg Leu Arg Glu Ala Gly Val 

85 90 95 . , 



20 



i * 



Gin Glu lie Asp Gly lie Cys Tyr Asp Leu Gly Val Ser Ser Pro Gin 

100 105 110 

Leu Asp Gin Arg Glu Arg Gly Phe Ser Tyr Lys Lys Asp Ala Pro Leu 
115 ' 120 125 



. Asp Met Arg Met Asn Gin Asp Ala Ser Leu Thr Ala Tyr Glu Val Val 
25 130 135 ■ 140 

Asn His Tyr Asp Tyr His Asp Leu Val Arg lie Phe Phe Lys Tyr Gly 
145 150 155 160 

30 Glu Asp Lys Phe Ser Lys Gin lie Ala Arg Lys lie Glu Gin Ala Arg 

165 170 175 



35 



Glu Val Lys Pro lie Glu Thr Thr Thr Glu Leu Ala Glu lie lie Lys 

180 185 190 

Leu Val Ly§ Pro Ala Lys Glu Leu Lys Lys Lys Gly His Pro Ala Lys 
195 200 205 



Gin lie Phe Gin Ala lie Arg lie Glu Val Asn Asp Glu Leu Gly Ala 
40 ' 210 215 220 

Ala Asp Glu Ser lie Gin Gin Ala Met Asp Met Leu Ala Leu Asp Gly 
225 230 235 240 

45 Arg lie Ser Val lie Thr Phe His Ser Leu Glu Asp Arg Leu Thr Lys 

245 250 255 



50 



Gin Leu Phe Lys Glu Ala Ser Thr Val Glu Val Pro Lys Gly Leu Pro 

260 265 270 

Phe lie Pro Asp Asp Leu Lys Pro Lys Met Glu Leu Val Ser Arg Lys 

275 280 285 



Pro lie Leu Pro Ser Ala Glu Glu Leu Glu Ala Asn Asn Arg Ser His 
55 290 295 300 



Ser Ala Lys Leu Arg Val Val Arg Lys lie His Lys 
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305 310 315 



<210> 132- 1 ! , 1 V ' • 

<211> 332 ' 1 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 132 

Met Ser Arg lie Leu Asp'Asn Glu lie Met Gly Asp Glu Glu Leu Val 
1 5 1 10 15 

i ! + i 

Glu Arg Thr Leu Arg Pro Gin Tyr Leu Arg Glu Tyr lie Gly Gin Asp 

1 20 ! 25 , 30 

' i 

Lys Val Lys Asp Gin Leu Gin lie Phe. lie Glu Ala Ala Lys Met Arg 
35 40 i 45 

Asp Glu Ala Leu Asp His Val Leu Leu Phe Gly Pro Pro Gly Leu Gly 
50 '55 ,60 

* 

Lys Thr Thr Met Ala Phe Val He Ala Asn Glu Leu Gly Val Asn Leu 
65 70 75 80 

Lys Gin Thr Ser Gly Pro Val He Glu Lys Ala Gly Asp Leu Val Ala 

.85 90 95 

He Leu Asn Glu Leu Glu Pro Gly Asp Val Leu Phe He Asp Glu He 

100 105 110 

His Arg Leu Pro Met Ser Val Glu Glu Val Leu Tyr Ser Ala Met Glu 
115 120 125 

Asp Phe Tyr He Asp lie Met He Gly Ala Gly Glu Gly Ser Arg Ser 
130 135 140 

Val His Leu Glu Leu Pro Pro Phe Thr Leu He Gly Ala Thr Thr Arg 
145 150 155 160 

■ 

Ala Gly Met Leu Ser Asn Pro Leu Arg Ala Arg Phe Gly He Thr Gly 

165 170 .175 

His Met Glu Tyr Tyr Ala His Ala Asp Leu Thr . Glu He Val Glu Arg 

180 185 190 

Thr- Ala Asp He Phe Glu Met Glu He Thr His Glu Ala Ala Ser Glu 
195 200 205 

Leu Ala Leu Arg Ser Arg Gly Thr Pro Arg He Ala Asn Arg Leu Leu 
210 215 220 

Lys Arg Val Arg Asp Phe Ala Gin He Met Gly Asn Gly Val He Asp 
225 230 235 240 

Asp He He Thr Asp Lys Ala Leu Thr Met Leu Asp Val Asp His Glu 

245 250 '255 
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40 
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Gly Leu Asp Tyr Val Asp Gin Lys lie Leu Arg Thr Met lie Glu Met 

260 \ ' 265 270 ' 

i 

Tyr Ser Gly GlyProiVal Gly Leu Gly Thr Leu Ser' Val Asri lie Ala 

275 ' ' 280 1 ' \ 285 

Glu Glu Arg Glu Thr Val Glu Asp Met Tyr Glu Pro Tyr Leu lie Gin 
290 295 300 

Lys Gly Phe lie Met ) Arg 'Thr Arg Ser , Gly Arg Val Ala Thr Ala Lys 
305 ' 310 315 , . 320 

: ' * 1 > , 

i 1 

Ala Tyr Glu His Leu Gly Tyr Glu Tyr Ser Glu Lys , 

' ' 325 ' 330, 



<210> 133 
<211> 436 
<212> PRT 
20 <213> Streptococcus pneumoniae 



<400> 133 ( 

Met Ser Met Phe Leu Asp Thr Ala Lys lie Lys Val Lys Ala Gly Asn 
1 5 . =10 15 

i 

Gly Gly Asp Gly Met Val Ala Phe Arg Arg Glu Lys Tyr Val Pro Asn 

20 25 30 



Gly Gly Pro Trp Gly Gly Asp Gly Gly Arg Gly Gly Asn Val Val Phe 

30 35 40 45 

Val Val Asp Glu Gly Leu Arg Thr Leu Met Asp Phe Arg Tyr Asn Arg 

50 55 60 

35 His Phe Lys Ala Asp Ser Gly' Glu Lys Gly Met Thr Lys Gly Met His 

.65 70 75 80 



Gly Arg Gly Ala Glu Asp Leu Arg Val Arg Val Ser Gin Gly Thr Thr 

85 90 95 

Val Arg Asp Ala Glu Thr Gly Lys Val Leu Thr Asp Leu lie Lys His 

100 105 110 



Gly Gin Glu Phe lie Val Ala His Gly Gly Arg Gly Gly Arg Gly Asn 

45 115 120 125 

lie Arg Phe Ala Thr Pro Lys Asn Pro Ala Pro Glu lie Ser Glu Asn 
130 135 140 

50 Gly Glu Pro Gly Gin Glu Arg Glu Leu Gin Leu Glu Leu Lys lie Leu 

145 150 155 160 



Ala Asp Val Gly Leu Val Gly Phe Pro Ser Val Gly Lys Ser Thr Leu 

165 170 175 

Leu Ser Val lie Thr Ser Ala Lys Pro Lys lie Gly Ala Tyr His Phe 

180 185 190 
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10 



25 



40 



50 



55 



Thr Thr lie Val Pro Asn Leu Gly Met Val Arg Thr Gin Ser Gly Glu 
195 200 205 

Ser Phe Ala Val Ala Asp Leu Pro Gly Leu lie Glu Gly Ala Ser Gin 

210 ' 215 i 220, - , 

* i * 

i 

Gly Val Gly Leu Gly Thr Gin Phe Leu Arg His lie Glu Arg Thr Arg 
225 230 235 240 

Val lie Leu His lie lie Asp Met Ser Ala Ser Glu Gly Arg Asp Pro 
. , 245 i 250 255 



Tyr Glu Asp Tyr Leu Ala lie Ash Lys Glu Leu Glu Ser Tyr Asn Leu 
15 , 260 i 265 270 , 

I 

Arg Leu Met Glu Arg Pro Gin lie lie Val Ala Asn Lys Met Asp Met! 
275 280 285 

i 

20 Pro Glu Ser Gin Glu Asn Leu Glu Glu Phe Lys Lys Lys Leu Ala Glu 
290 295 ! 300 



Asn Tyr Asp Glu Phe Glu Glu Leu Pro Ala lie Phe Pro lie Ser Gly 
305 310 | 315 320 

Leu Thr Lys Glri Gly Leu Ala Thr Leu Leu Asp Ala Thr Ala Glu Leu 

325 330 335 



Leu Asp Lys Thr Pro Glu Phe Leu Leu Tyr Asp Glu Ser Asp Met Glu 
30 340 345 350 

Glu Glu Ala Tyr Tyr Gly Phe Asp Glu Glu Glu Lys Ala Phe Glu lie 
355 360 365 

35 Ser Arg Asp Asp Asp Ala Thr Trp Val Leu Ser Gly Glu Lys Leu Met 
370 375 380 



Lys Leu Phe Asn Met Thr Asn Phe Asp Arg Asp Glu Ser Val Met Lys 
385 390 395 ' 400 

Phe Ala Arg Gin Leu Arg Gly Met Gly Val Asp Glu Ala Leu Arg Ala 

405 * 410 415 



Arg Gly Ala Lys Asp Gly Asp Leu Val Arg lie Gly Lys Phe Glu Phe 
45 '420 425 430 



Glu Phe Val Asp 
435 



<210> 134 
<211> 172 
<212> PRT 

<213> Streptococcus pneumoniae 



<400> 134 

Met Asn Tyr Phe, Asn Val Gly Lys lie Val Asn Thr Gin Gly Leu Gin 
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i 

1 5 10 15 

Gly Glu Met Arg Val Leu Ser Val Thr Asp Phe Ala Glu Glu Arg Phe 

. ' 20 25 * . , 30 ' ■ . " ' . 

Lys Lys Gly Ala Glu Leu Ala , Leu Phe Asp Glu Lys Asp Gin Phe- Val ( 

35 40 45 

Gin Thr Val Thr lie Ala Ser His Arg Lys Gin Lys Asn Phe Asp lie 

10 50 55 , 60 

lie Lys Phe Lys Asp Met Tyr Hi's lie Asn "Thr lie Glu Lys Tyr LyS' ( 



20 



45 



65 70 



75 . 80 



15 Gly Tyr Ser Leu Lys Val Ala Glu Glu Asp Leu Asn Asp Leu Asp Asp 

.85 90 95 



Gly Glu Phe Tyr Tyr His Glu He He Gly Leu Glu Val Tyr Glu Gly 

100 105 110 

Asp Ser Leu Val Gly Thr He Lys Glu He Leu Gin Pro Gly Ala Asn 
115 1 120 125 



Asp Val Trp Val Val Lys Arg LyS Gly Lys Arg Asp lieu Leu Leu Pro 
25 130 135 ! " 140 

Tyr He Pro Pro Val Val Leu Asn Val Asp He Pro Asn Lys Arg Val 
145 150 . 155 • 160 

30 Asp Val Glu He Leu Glu Gly Leu Asp Asp Glu Asp 

165 170 



<210> 135 
35 <211> 239 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 135 

40 Met Lys He Asp He Leu Thr Leu Phe Pro Glu Met Phe Ser Pro Leu 
1 5 10 15 



Glu His Ser He Val Gly Lys Ala Arg Glu Lys Gly Leu Leu Asp He 

20 25 30 

Gin Tyr His Asn Phe Arg Glu Asn Ala Glu Lys Ala Arg His Val Asp 
35 40 45 



Asp Glu Pro Tyr Gly Gly Gly Gin Gly Met Leu Leu Arg Ala Gin Pro 
50 50 55 60 

He Phe Asp Ser Phe Asp Ala He Glu Lys Lys Asn Pro Arg Val He 
65 70 -75 80 

55 Leu Leu Asp Pro Ala Gly Lys Gin Phe Asp Gin Ala Tyr Ala Glu Asp 

85 90 95 
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i 

Leu Ala Gin Glu Glu Glu Leu lie Phe lie Cys Gly His Tyr Glu Gly 

100 1 1 105 110 ' * 

i 

Tyr Asp Glu Arg lie t Lys Thr Leu Vai Thr 1 Asp Glu< lie Ser Leu Gly 

115 . 1 120 i ■ , 125 1 



■ t 



Asp Tyr* Val Leu Thr Gly Gly Glu Leu Ala Ala Met Thr Met lie Asp 
130 ' 135 140 

Ala Thr Val Arg Leu lie 'Pro Glu Val lie Gly Lys Glu Ser Ser His 
145 1 150 ' 155 ■ i 160 

Gin Asp Asp Ser . Phe Ser Ser Gly Lei} Leii Glu Tyr His Gin Tyr Thr 

' 1 165 i " 170, 175 

i 

i 

Arg Pro Tyr Asp Tyr Arg Gly Met Val Val Pro Asp Val Leu Met Ser 

180 185 190 

Gly His His Glu Lys lie Arg Gin Trp Arg Leu Tyr Glu Ser Leu Lys 
195 200 , 205 

* 

Lys Thr Tyr Glu Arg Arg Pro Asp Leu Leu Glu His Tyr 1 Gin Leu Thr 
210 215 220 

Val Glu Glu Glu Lys Met Leu Ala Glu lie Lys Glu Asn Lys Glu 
225 230 235 



<210> 136 
<211> 186 
<212> PRT 

<213> Streptococcus pneumoniae 



<400> 136 

Met lie Glu Ala Ser Lys Leu Lys Ala Gly Met Thr Phe Glu Thr Ala 
15 10 15 



Asp Gly Lys Leu 

20 

Lys Gly Asn Thr 
35 

Ser Thr Phe Asp 
50 

lie He Glu Thr 
65 

Ala Tyr Phe Met 



Val Asn Val Glu 



He Arg Val Leu 



He Met Arg Met 

40 

Thr Ser Tyr Arg 
55 

Val Pro Ala Gin 
70 

Asn Thr Glu Thr 
85 

Asn Glu Leu Leu 



Glu Ala Ser His 
25 

Lys Leu Arg Asp 



Pro Glu Glu Lys 

60 

Tyr Leu Tyr Lys 
75 

Tyr Asp Gin Tyr 
90 

Tyr He Leu Glu 



His Lys Pro Gly 
30 

Val Arg Thr Gly 
45 

Phe Glu Gin Ala 



Met Asp Asp Thr 

80 

Glu He Pro Val 
95 

Asn Ser Asp Val 



74 



WO 01/49721 



PCT/US00/35604 



100 



105 



110 



10 



Lys lie Gin' Phe Tyr Gly Thr Glu Val lie Gly Val Thr Val Pro Thr 

, 115 _ 120 125 ' ( 

Thr Val Glu Leu Thr Val Ala Glu Thr, Gin Pro' Ser lie Lys Gly Ala, • 
130 135 140 

* 

Thr Val Thr Gly Ser Gly Lys Pro Ala Thr Met Glu Thr Gly Leu Val 
145 150 ' , 155' . 160 



Val Asn Val Pro Asp, Phe lie Gl 

, 165* 



ji Ala Gly Gin Lys Leu Val lie Asn 
170 ' 175 



15 Thr Al'a Glu Gly Thr Tyr Val Ser Arg Ala 

180 ; 185 



i * 



20 <210> 137 
<211> 523 
<212> PRT 

<213> Streptococcus pneumoniae, 

! . 

25 <400> 137 

Met Ala Phe Glu Ser Leu Thr Glu Arg Leu Gin Asn Val Phe Lys Asn 
1 5 10 15 



Leu Arg Lys Lys Gly Lys lie Ser Glu Ser Asp Val Gin Glu Ala Thr 

30 20. 25 30 

Lys Glu lie Arg Leu Ala Leu Leu Glu Ala Asp Val Ala Leu Pro Val 

35 40 45 

35 Val Lys Asp Phe lie Lys Lys Val Arg Glu Arg Ala Val Gly His Glu 

50 55 60 



40 



45 



Val lie Asp Thr Leu Asn Pro Ala Gin Gin lie lie Lys lie Val Asp 
65 70 75 80 

Glu Glu Leu Thr Ala Val' Leu Gly Ser Asp Thr Ala Glu lie lie Lys 

85 90 95 

Ser Pro Lys lie Pro Thr lie lie Met Met Val Gly Leu Gin Gly Ala 

100 105 110 



Gly Lys Thr Thr Phe Ala Gly Lys Leu Ala Asn Lys Leu Lys Lys Glu 
115 120 125 

50 Glu Asn Ala Arg Pro Leu Met lie Ala Ala Asp lie Tyr Arg Pro Ala 
130 135 140 



55 



Ala lie Asp Gin Leu Lys Thr Leu Gly Gin Gin lie Asp Val Pro Val 
145 150 155. 160 

Phe Ala Leu Gly Thr Glu Val Pro Ala Val Glu He Val Arg Gin Gly 

165 170 175 



75 
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Leu Glu Gin Ala Gin Thr Asn His Asn Asp Tyr Val Leu lie Asp Thr 

180 185 190 



1 ' .(.. , 



Ala Gly Arg Leu Gin lie Asp Glu Leu Leu Met £sn Glu Leu Arg Asp 

195 (! , . 200 , ' 205 

. > i 't 

i * 

i 

Val Lys Thr Leu Ala Gin Pro Asn Glu lie Leu Leu' Val Val Asp Ala 
210 215 220 . 

I 1 . r . . 

Met lie Gly Gin Glu 1 Ala' Ala Asn Val' Ala Arg Glu Phe Asn Ala Gin 
225 230 235' , ' 240 

Leu Glu Val' Thr Gly Val lie Leu Thr Lys lie Asp Gly .Asp Thr Arg 

245 250 255 

i 

Gly Gly Ala Ala Leu Ser Val Arg His lie Thr Gly Lys Pro lie Lys 

260 265 ' 270 

i * 

Phe Thr Gly "Thr Gly Glu Lys lie Thr Asp lie Glu Thr Phe His Pro 
275 280 285 

■ i • 

Asp Arg Met Ser Ser Arg lie Leu Gly Met Gly Asp Met Leu Thr Leu 
290 295 300 

lie Glu Lys Ala Ser Gin Glu ,Tyr Asp Glu Gin Lys Ala Leu Glu Met 
305 310 315 320 

Ala Glu Lys Met Arg Glu Asn Thr Phe Asp Phe Asn Asp Phe lie Asp 

325 330 335 

* 

Gin Leu Asp Gin Val Gin Asn Met Gly Pro Met Glu Asp Leu Leu Lys 

340 345 350 

Met lie Pro Gly Met Ala Asn Asn Pro Ala Leu Gin Asn Met Lys Val 
355 360 365 

Asp Glu Arg Gin lie Ala Arg Lys Arg Ala lie Val Ser Ser Met Thr 
370 375 380 

Pro Glu Glu Arg Glu Asn Pro Asp Leu Leu Asn Pro Ser Arg Arg Arg 
385 390 395 400 

Arg lie Ala Ala Gly Ser Gly Asn Thr Phe Val Glu Val Asn Lys Phe 

405 410 415 

lie Lys Asp Phe Asn Gin Ala Lys Gin Leu Met Gin Gly Val Met Ser 

420 425 430 

* * 

Gly Asp Met Asn Lys Met Met Lys Gin Met Gly lie Asn Pro Asn Asn 
435 440 445 

Leu Pro Lys Asn Met Pro Asn Met Gly Gly Met Asp Met Ser Ala Leu 
450 455 460 

Glu Gly Met Met Gly Gin Gly Gly Met Pro Asp Leu Ser Ala Leu Gly 
465 470 475 480 
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10 



20 



35 



50 



Gly Ala Gly Met Pro Asp' Met Ser Gin Met Phe Gly Gly Gly Leu Lys 

485 490 495 



! 1 ■ . 



Gly Lys lie Gly Glu Phe Ala Met Lys Gin Ser Me;t Lys Afg Met Ala 

500, 50,5 510 

i * 

i ' 
Asn Lys Met Lys Lys Ala Lys Lys Lys Arg Lys 

515 520 



<210> 138 

<211> 281 \ . i 

<212i> PRT ' ' 
15 <213> Streptococcus pneumoniae 



<400> 138 

Met Tyr Leu lie Glu lie Leu Lys Ser lie Phe Phe Gly lie Val Glu 
1 5 10 : ' 15 

Gly lie Thr Glu Trp Leu Pro lie Ser Ser Thr Gly His Leu lie Leu 

1 20 25 ' 30 



Ala Glu Giu> Phe lie Uin Tyr Gin Asn Gin Asn biu Ala Phe Met Ser 

25 35 40 45 

Met Phe Asn Val Val lie Gin Leu Gly Ala lie Leu Ala Val Met Val 

50 55 60 

30 lie Tyr Phe Asn Lys Leu Asn. Pro Phe Lys Pro' Thr Lys Asp Lys Gin 

65 70 75 80 



Glu Val Arg Lys Thr Trp Arg Leu Trp Leu Lys Val Leu lie Ala Thr 

85 90 95 

Leu Pro Leu Leu Gly Val Phe Lys Phe Asp Asp Trp Phe Asp Thr His 

100 105 110 



Phe His Asn Met Val Ser Val Ala Leu Met Leu lie lie Tyr Gly Val 
40 ' 115 120 125 

Ala Phe lie Tyr Leu Glu Lys Arg Asn Lys Ala Arg Ala lie Glu Pro 

130 135 140 

45 Ser Val Thr Glu Leu Asp Lys Leu Pro Tyr Thr Thr Ala Phe Tyr lie 

145 150 155 160 



Gly Leu Phe Gin Val Leu Ala Leu Leu Pro Gly Thr Ser Arg Ser Gly 

165 170 175 

* 

Ala Thr He Val Gly Gly Leu Leu Asn Gly Thr Ser Arg Ser Val Val 

180 185 190 



Thr Glu Phe Thr Phe Tyr Leu Gly He Pro Val Met Phe Gly Ala Ser 
55 195 200 205 

Ala L§u Lys He Phe Lys Phe Val Lys Ala Gly Glu Leu Leu Ser Phe 
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210 215 . 1 220 

Gly Gin Leu Phe Leu Leu Leu Val Ala Met Gly Val Ala Phe Ala Val 
225 230 235 240 

Ser Met Val Ala lie Arg Phe Leu Thr Ser Tyr Val.Lys Lys His, Asp 

245 250 255 

Phe Thr Leu Phe Gly Lys Tyr Arg lie Val Leu Gly Ser Val Leu Leu 

260 265 270 

Leu Tyr Ser Phe Val Arg Leu Phe Val 

275 280 

i 

t 
i 

i < 

<210> 139 

<211> 429 I t ■ 

<212> PRT 

<213> Streptococcus pneumoniae 

* 

<400> 139 

Met Gly Leu Phe Asp Arg Leu Phe Gly Lys Lys Glu Glu Pro Lys lie 
1 5 10 15 

i * 

Glu Glu Val Val Lys Glu Ala Leu Glu Asn Leu Asp Leu Ser Glu Asp 

20 25 30 

lie Glu Pro Ala Phe Thr Glu Ala Glu Glu Val Ser Gin Glu Glu Ala 
35 40 45 

Glu Val Glu Ser Ser Glu Glu Ser Val Phe Gin Glu Glu Asp Ser Gin 
50 55 60 

Asp Thr Val Glu Glu Asn Leu Asp Leu Glu Pro Val Val Glu Val Ser 
65 70 75 80 

Gin Glu Glu Val Glu Glu Phe Pro Asn Ser Gin Glu Val Thr Glu Glu 

85 90 95 

Glu Lys Leu Glu His Glu Gly' Thr Val Glu Glu Asn Asn Phe Glu Val 

100 , 105 110 

Leu Glu Pro Glu Ala Pro Gin Thr Glu Glu Thr Val Gin Glu Lys Tyr 
115 120 125 

Asp Arg Ser Leu Lys Lys Thr Arg Thr Gly Phe Gly Ala Arg Leu Asn 
130 135 140 

Ala Phe Phe Ala Asn Phe Arg Ser Val Asp Glu Glu Phe ' Phe Glu Glu 
145 150 155 160 

Leu Glu Glu Leu Leu lie Met Ser Asp Val Gly Val Gin Val Ala Ser 

165 170 175 

Asn Leu Thr Glu Glu Leu Arg Tyr Glu Ala Lys Leu Glu Asn Ala Lys 

180 185 190 
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Lys Pro Asp Ala Leu Arg Arg Val II lie Glu Lys Leu Val Glu Leu 
195 200 205 



Tyr Glu Lys Asp. Gly Ser Tyr Asp Glu Ser lie His Phe Gin Asp Ash 
210 215 220 



10 



15 



Leu Thr Val Met Leu Phe Val Gly Val Asn Gly Val Gly Lys Thr Thr 
225 ' 230 235 240 

Ser lie Gly Lys Leu Ala His Arg Tyr Lys Arg Ala Gly Lys Lys Val 

245 250 255 

Met Leu Val Ala, Ala Asp Thr Phe Arg Ala Gly Ala Val Ala Gin' Leu 



260 



265 



270 



Ala Glu Trp Gly Arg Arg Val Asp Val Pro Val Val Thr Gly Pro Glu 
275 280 285 f 



Lys Ala Asp Pro Ala Ser Val Val Phe Asp Gly Met Glu Arg Ala Val 
20 290 295 300 

Ala Glu Gly lie Asp lie Leu Met lie Asp Thr Ala Gly Arg Leu Gin 

305 . 310 315 320 

i 

25 Asn Lys Asp Asn Leu Met Ala Glu Leu Glu Lys lie Gly Arg lie lie 

325 330 335 



30 



Lys Arg Val Val Pro Glu Ala Pro. His Glu Thr Phe Leu Ala Leu Asp 

340 . 345 350 

Ala Ser Thr Gly Gin Asn Ala Leu Val Gin Ala Lys Glu Phe Ser Lys 
355 360 365 



lie Thr Pro Leu Thr Gly lie Val Leu Thr Lys lie Asp Gly Thr Ala 
35 370 375 380 

Arg Gly Gly Val Val Leu Ala lie Arg Glu Glu Leu Asn lie Pro Val 
385 * 390 395 400 

40 Lys Leu lie Gly Phe Gly Glu Lys lie Asp Asp lie Gly Glu. Phe Asn 

405 410 415 



45 



Ser Glu Asn Phe Met Lys Gly Leu Leu Glu Gly Leu lie 

420 425 



50 



<210> 140 
<211> 165 
<212> PRT 

<213> Streptococcus pneumoniae 



55 



<400> 140 

Met Tyr lie Glu Met Val Asp Glu Thr Gly Gin Val Ser Lys Glu Met 
15 10 15 

Leu Gin Gin Thr Gin Glu lie Leu Glu Phe Ala Ala Lys Lys Leu Gly 

20 . 25 30 
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Lys Glu Asp Lys Glu M t 1 Ala Val Thr Phe Val ^hr Asn Glu Arg Ser 

35 40 45 

1 i , .i .... 

' i . ... 

His Glu Leu Asn Leu Glu Tyr Arg Asp Thr Asp £rg Pro Thr Asp Val 

50 55 , 60 v 

I 9 I | h t 

1 • 1 

lie Ser Leu Glu Tyr Lys Pro Glu Leu Glu He Ala Phe Asp Glu Glu 
65 70 75 80 

Asp Leu Leu Glu Asn 'Pro Glu Leu Ala Glu Met Met Ser Glu • Phe Asp 

85 90' , ■ 95 

Ala Tyr He Gly Glu Leu Phe He Ser He, Asp Lys .Ala His Glu Gin 

100 ( 105 \ 110 

r I 

Ala Glu Glu Tyr Gly His Ser Phe Glu Arg Glu Met Gly Phe Leu Ala 
11*5 120 125 

p ■ 

r 

Val His Gly Phe Leu His He Asn Gly Tyr Asp His Tyr Thr Pro Glu 
130 135 140 

Glu Glu Ala Glu Met Phe Gly Leu Gin Glu Glu He Leu Thr Ala Tyr 
145 150 155 . 160 

Gly Leu Thr Arg Gin 

165 



<210> 141 
<211> 255 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 141 

Met Ser He Arg Val He He Ala Gly Phe Lys Gly Lys Met Gly Gin 
15 10 15 

Ala Ala Cys Gin Met Val Leu Thr Asp Pro Asp Leu Asp Leu Val Ala 

20 25 30. 

Val Leu Asp Pro Phe Glu Ser Glu Ser Glu Trp Gin Gly He Pro Val 
35 - 40 45 

Phe Lys Asp Lys Ala Asp Leu Ala Gly Phe Glu Ala Asp Val Trp Val 
50 55 60 

Asp Phe Thr Thr Pro Ala Val Ala Tyr Glu Asn Thr Arg Phe Ala Leu 
65 70 75 80 

Glu Asn Gly Phe Ala Pro Val Val Gly Thr Thr Gly Phe Thr Ser Glu 

85 90 95 

Glu He Ala Glu Leu Lys Glu Phe Ser Arg Ala Gin Asp Leu Gly Gly 

' 100 105 110 

Leu He Ala Pro Asn Phe Ala Leu Gly Ala Val Leu Leu Met Gin Phe 



.80 
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115 12Q 125 

Ala Thr Gin Ala Ala Lys Tyr Phe Pro Asn Val Glu lie lie Glu Leu 
130 1 , 135 140' 

s ■ ■ ■ ■ . , ' 

His His Asp Lys Lys Lys Asp Ala Pro Ser Gly. Thr Ala lie Lys 1 Thr 
145 , 150 155 160 

Ala Glu Leu Met Ala Glu Val Arg Glu Ser lie Gln ( Gln' Gly Ala Ala 
10 165 1 170 ' 175 

, , ■ . ' , i 
Asp Glu Glu Glu Leu lie Ala Gly Ala Arg Gly Ala Asp Phe. Asp* Gly 

180 185 190 

15 Met Arg He His Ser Val Arg Leu Pro Gly Leu Val Ala His Gin Glu 

195 200 205 



20 



Val He Phe Gly Asn Gin Gly Glu Gly Leu Thr Leu Arg His Asp Ser 
210 215 220 

i 

Tyr Asp Arg He Ser Phe Met Thr Gly Val Asn Leu Gly He Lys Glu 
225 1 230 235 1 240 

w «-*-*■ v « j-ij & i-i-u y wjula j-k=u v a j_ i yi \3JLy jjcu Ol U nib j_it» Q Xje U 

25 245 250 255 



<210> 142 
<211> 91 
30 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 142 

Met Ala Asn Lys Gin Asp Leu He Ala. Lys Val Ala Glu Ala Thr Glu 
35 1 5 10 15 

Leu Thr Lys Lys Asp Ser Ala Ala Ala Val Glu Ala Val Phe Ala Ala 

20 - 25 30 - 

40 Val Ala Asp Tyr Leu Ala Ala Gly Glu Lys Val Gin Leu He Gly Phe 

35 40 45 



45 



Ser Asn Phe Glu Val Arg - Glu Arg Ala Glu Arg Lys Gly Arg Asn Pro 
50 55 60 

Gin Thr Gly Lys Glu Met Thr lie Ala Ala Ser Lys Val Pro Ala Phe- 
65 70 75 80 



Lys Ala Gly Lys Ala Leu Lys Asp Ala Val Lys 
50 . 85 90 



<210> 143 
<211> 306 
55 <212> PRT 

<213> Streptococcus pneumoniae 
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<400> 143 

Met Thr Lys Thr Ala Phe Leu Phe Ala Gly Gin Gly Ala Gin Tyr Leu 
15 10 / 15 

Gly Met Gly Arg Asp Phe Tyr Asp Gin Tyr Pro lie Val Lys Glu Thr 

20 25. 1 30 , 



10 



15 



He Asp Arg Ala Ser Gin Val Leu Gly Tyr Asp Leu Arg Tyr Leu He 
35 4P 45 

i ! 

Asp Thr Glu Glu Asp Lys Leu Asn Gin Thr Arg Tyr Thr Gin Pro Ala 



50 



55 



60 



He Leu Ala Thr Ser Val Ala He Tyr Arg Leu Leu Gin Glu Lys Gly 
65 70 I 75 80 



20 



25 



Tyr Gin Pro Asp Met Val Ala Gly Leu Ser Leu Gly Glu Tyr Ser Ala 

85 90 95 

Leu Val Ala Ser Gly Ala Leu Asp Phe Glu Asp Ala Val Ala Leu Val 

100 105 110 

Ala Lys Arg Gly Ala Tyr Met Glu Glu Ala Ala Pro Ala Asp Ser Gly 
115 120 125 

Lys Met Val Ala Val Leu Asn Thr Pro Val Glu Val He Glu Glu Ala 
130 135 140 



i ■ 



Cys Gin Lys Ala Ser Glu Leu Gly Val Val Thr Pro Ala Asn Tyr Asn 
30 145. * 150 155 160 

Thr Pro Ala Gin He Val He Ala Gly Glu Val Val Ala -Val Asp Arg 

165 170 175 



35 Ala Val Glu Leu Leu Gin Glu Ala Gly Ala Lys Arg Leu He Pro Leu 

180 185 190 

Lys Val Ser Gly Pro Phe His Thr Ala Leu Leu Glu Pro Ala Ser Gin 
195 200 205 

Lys Leu Ala Glu Thr Leu Ala Gin Val Ser Phe Ser Asp Phe Thr Cys 
210 ^ 215 220 

Pro Leu Val Gly Asn Thr Glu Ala Ala Val Met Gin Lys Glu Asp He 
45 225 230 235 240 



40 



Ala Gin Leu Leu Thr Arg Gin Val Lys Glu Pro Val Arg Phe Tyr Glu 

245 250. 255 

50 Ser He Gly Val Met. Gin Glu Ala Gly He Ser Asn Phe He Glu He 

260 265 270 



55 



Gly Pro Gly Lys Val Leu Ser Gly Phe Val Lys Lys He Asp Gin Thr 

275 280 285 

Ala His Leu Ala His Val Glu Asp Gin Ala Ser Leu Val Ala Leu Leu 
290 295 300 



j 
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10 



20 



35 



50 



Glu Lys 
305 



<210> 144 ■ t 

<211> 243 
<212> PRT 

<213> Streptococcus • pneumoniae , 

<400> 144 1 ' , , " ' ' , 

Met Lys Leu Glu His Lys Asn lie Phe lie Thr . Gly Ser 'Ser. Arg Gly 
1 5 \ . i 10 15 



15 lie Gly Leu Ala lie Ala His Lys Phe Ala Gin- Ala Gly Ala Asn He 

20 25 30 



Val Leu Asn Ser Arg Gly Ala He Ser Glu Glu Leu Leu Ala Glu Phe 
35 40 45 

t 

Ser Asn Tyr Gly He Lys Val Val Pro He Ser Gly Asp Val Ser Asp 
50 1 55 60 



Phe Ala Asp Ala Lys Arg Met lie Asp Gin Ala He Ala Glu Leu Gly 
25 65 70 75 80 

Ser Val Asp Val Leu Val Asn Asn Ala Gly He Thr Gin Asp Thr Leu 

85 90 95 

i 

30 Met Leu Lys Met Thr Glu. Ala Asp Phe Glu Lys Val Leu Lys Val Asn 

100 105 110 



Leu Thr Gly Ala Phe Asn Met Thr Gin Ser Val Leu Lys Pro Met Met 
115 120 125 

Lys Ala Arg Glu Gly Ala He He Asn Met Ser Ser Val Val Gly Leu 
130 135 140 



Met Gly Asn lie Gly Gin Ala Asn Tyr Ala Ala Ser Lys Ala Gly Leu 
40 145 150 155 160 

He Gly Phe Thr Lys Ser Val Ala Arg Glu Val Ala Ser Arg Asn He 

165 170 175 

45 Arg Val Asn Val He Ala Pro Gly Met He Glu Ser Asp Met Thr Ala 

180 185 190 



He Leu Ser Asp Lys He Lys" Glu Ala Thr Leu Ala Gin He Pro Met 
195 200 205 

Lys Glu Phe Gly Gin Ala Glu Gin Val Ala Asp Leu Thr Val Phe Leu 
210 215 220 



* 

Ala Gly Gin Asp Tyr Leu Thr Gly Gin Val Val Ala He Asp Gly Gly 
55 225 230 235 240 



Leu Ser Met 
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30 



45 



<210> 145 
5 ' <211> 276 

<212> PRT ' , 

<213> Streptococcus pneumoniae . ' 

<400> 145 

10 Met Gly Val Lys Lys Lys Leu Lys Leu Thr Ser Leu Leu Gly Leu Ser 
1 5 ' . 10 15 

■ Leu Leu lie Met Thr Ala Cys Ala Thr Asn Gly Val Thr Ser Asp ■ lie 

20 I ,25 30 

15 ■ , i 

Thr Ala Glu Ser Ala Asp Phe Trp Ser Lys Leu Val Tyr Phe Phe Ala 
35 40 45 t 

Glu lie lie Arg Phe Leu Ser Phe Asp lie. Ser lie Gly Val Gly lie 
20 50 55 60 

lie Leu Phe Thr Val Leu lie Arg Thr Val Leu Leu Pro Val Phe Gin 

65 70 75 80 

j 

25 Val Gin Met Val Ala Ser Arg Ly^ Met Gin Glu Ala Gin Pro Arg lie 

85 90 95 

i 

Lys Ala Leu Arg Glu Gin Tyr Pro Gly Arg Asp Met Glu Ser Arg Thr 

100 105 110 

Lys Leu Glu Gin Glu Met Arg Lys Val Phe Lys Glu Met Gly Val Arg 
115 120 125 



i ■ • 



Gin Ser Asp Ser Leu Trp Pro lie Leu lie Gin Met Pro Val lie Leu 
35 130 135 140 

Ala Leu Phe Gin Ala Leu Ser Arg Val Asp Phe Leu Lys Thr Gly His 
145 150 155 160 

40 Phe Leu Trp lie Asn Leu Gly Ser Val Asp Thr Thr Leu Val Leu Pro 

165 170 175 



lie Leu Ala Ala Val Phe Thr Phe Leu Ser Thr Trp Leu Ser Asn Lys 

180 185 190 

Ala Leu Ser Glu Arg Asn Gly Ala Thr Thr Ala Met Met Tyr Gly lie 
195 200 205 



Pro Val Leu lie Phe lie Phe Ala Val Tyr Ala Pro Gly Gly Val Ala 
50 210 215 220 

Leu Tyr Trp Thr Val Ser Asn Ala Tyr Gin Val Leu Gin Thr Tyr Phe 
225 230 235 240 

55 Leu Asn Asn Pro Phe Lys lie lie Ala Glu Arg Glu Ala Val Val Gin 

245 250 255 
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Ala Gin Lys Asp Leu Glu Asn Arg Lys Arg Lys Ala Lys Lys Lys Ala 

260 265 270 

> Glri Lys Thr Lys 
5 275 



<210> 146 

<211> 409 

10 <212> PRT 

<213> Streptpcoccus pneumoniae 

<400> 146 

Met Lys lie Ser Lys Arg His Le 
15 <1 ' 5 



u Leu Asn Tyr Ser lie Leu lie Pro 
10 1-5 



I 

Tyr Leu Leu Leu Ser lie Leu Gly Leu lie Val Val Tyr Ser Thr Thr 

20 25 30 

20 Ser Ala lie Leu lie Glu Glu Gly Lys Ser Ala Leu Gin Leu Val Arg 

35 40 45 



i ■ 



25 



Asn Gin Gly lie Phe Trp lie Val Ser Leu lie Leu lie Ala Leu lie 

50 55 ! . 60 ' 

Tyr Lys Leu Arg Leu Asp Phe Leu Arg Asn Glu Arg Leu lie lie' Leu 

65 70 75 80 



Val lie Leu lie Glu Met Leu Leu Leu Phe Leu Ala Arg Phe lie Gly 
30 • 85 90 95 

lie Ser Val Asn Gly Ala Tyr Gly Trp lie Ser Val Ala Gly Val Thr 

100 105 110 

35 lie Gin Pro Ala Glu Tyr Leu Lys lie lie lie lie Trp Tyr Leu Ala 

115 120 125 



40 



His Arg Phe Ser Lys Gin Gin Glu Glu lie Ala Thr Tyr Asp Phe Gin 

130 -135 140 

Val Leu Thr Gin Asn Gin Trp Leu Pro Arg Ala Phe Asn Asp Trp Arg 

145 150 155 160 



Phe Val Leu Leu Val Leu lie Gly Ser Leu Gly lie Phe Pro Asp Leu 
45 165 170 175 

Gly Asn Ala Thr lie Leu Val Leu Val Ser Leu He Met Tyr Thr Val 

180 185 190 

50 Ser Gly He Ala Tyr Arg Trp Phe Ser Thr He Leu Ala Leu Val Ser 

195 200 205 



55 



Ala Thr Ser Val Phe Val Leu Thr Thr He Ser Leu He Gly Val Glu 
210 215 220 

* * 

Thr Phe Ser Lys He Pro Val Phe Gly Tyr Val Ala Lys Arg Phe Ser 
225 230 235 240 



85 
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Ala Phe Phe Asn Pro Phe Ala Asp Arg Ala Asp Ala Gly His Gin Leu 

245 ' 250 255 

i 

Ala Asn Ser Tyr Phe Ala Met Val Asn Gly Gly Trp Phe Gly Leu Gly 

260 265 270 

Leu Gly Asn Ser lie Glu Lys Arg Gly Tyr Leu Pro Glu Ala His Thr 
275 280 285 

Asp Phe Val Phe Ser 1 lie Val lie Glu' Glu Phe , Gly Phe Val 1 Gly Ala 
290 295 300 , ' 

Ser Leu lie Leu Ala Leu Leu Phe Phe Met, lie Leu Arg lie lie Leu 
305 , 310 315. 320 

Val Gly lie Arg Ala Glu Asn Pro Phe Asn Ala Met Val Ala Leu Gly 

325 330 335 

Val Gly Gly Met Met Leu Val Gin Val Phe Val Asn lie Gly Gly lie 

340 345 350 

i 

Ser Gly Leu lie Pro Ser Thr Gly Val Thr Phe Pro Phe Leu Ser Gin 
355 360 365 

Gly Gly Asn Ser Leu Leu Val Leu Ser Val Ala Val Ala Phe Val Leu 
370 375 380 

Asn lie Asp Ala Ser Glu Lys Arg Ala Lys Leu Tyr Arg Glu Leu Glu 
385 390 395 400 

Asn Gin Pro Met Asn Leu Leu Leu Lys 

405 



<210> 147 
<211> 419 
<212> PRT 

<213> Streptococcus 
<400> 147 

Met Leu Gly lie Leu 
1 5 

Val His Glu Phe Gly 

20 

* 

Val Arg Glu Phe Ala 
35 

Gly Lys Asp Gly Thr 
50 

Tyr Val Arg Met Ala 
65 

Gly Thr Pro Val Ser 



pneumoniae 



Thr Phe lie Leu Val Phe 

10 

His Phe Tyr Phe Ala Lys 

25 

lie Gly Met Gly Pro Lys 
40 

Ala Tyr Thr lie Arg lie 
55 

Gly Trp Gly Asp Asp Thr 
70 75 

Leu Thr Leu Ala Asp Asp 



Gly lie lie Val Val 

15 

Lys Ser Gly lie Leu 
30 

•He Phe Ala His He 
45 ■ 

Leu Pro Leu Gly Gly 
60 

Thr Glu He. Lys Thr 

■ 80 

Gly Lys Val Lys Arg 
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85 90 95. 

lie Asn Leu Ser Gly Lys Lys Leu Asp Gin Thr Aia Leu Pro Met Gin 

100 ' ! f 105 ' ' " ' 110 

5 • • • < , : 

Val Thr Gin Phe Asp Phe Glu Asp Lys Leu Phe lie. Lys Gly Leii Val 
115 i 120 ' * 125 

Leu Glu Glu Glu Lys Thr Phe Ala Val Asp His Asp, Ala Thr Val Val 

10 130 ( '135 , ' 140 

,r i 

Glu Ala Asp Gly Thr Glu Val Arg lie Ala Pro. Leu 1 Asp Val Gin Tyr 
145 , t 150 ■ ! 155 160 

15 Gin Asn Ala Thr lie Trp Gly Lys Leu lie Thr 1 Asn Phe Ala Gly Pro 

165 170 175 



20 



35 



50 



Met Asn Asn Phe lie Leu Gly Val Val yal Phe Trp Val Leu lie Phe 

180 185 190 

Met Gin Gly Gly Val Arg Asp Val* Asp Thr Asn Gin Phe His lie Met 
19*5 200 , 205 



Pro Gin Gly Ala Leu Ala Lys Val Gly Val Pro Glu Thr Ala Glh lie 

25 210 , 215 ' 220 

Thr Lys lie Gly Ser His Glu Val Ser Asn Trp Glu Ser Leu lie Gin 

225 230 235 240 

30 Ala Val Glu Thr Glu Thr Lys Asp Lys Thr. Ala Pro Thr Leu Asp Val 

245 250 255 



Thr lie Ser Glu Lys Gly Ser Asp Lys Gin Val Thr Val Thr Pro Glu 

260 265 270 

Asp Ser Gin Gly Arg Tyr Leu Leu Gly Val Gin Pro Gly Val Lys Ser 
275 280 285 



Asp Phe Leu Ser Met Phe Val Gly Gly Phe Thr Thr Ala Ala Asp Ser 
40 290 295 300 

Ala Leu Arg lie Leu Ser Ala Leu Lys Asn Leu lie Phe Gin Pro Asp 
305 310 315 320 

45 Leu Asn Lys Leu Gly Gly Pro Val Ala lie Phe. Lys Ala Ser Ser Asp 

325 330 335 



Ala Ala Lys Asn Gly lie Glu Asn lie Leu Tyr Phe Leu Ala Met lie 

340 345 350 

Ser lie Asn lie Gly lie Phe Asn Leu lie Pro lie Pro Ala Leu Asp 

355 360 365 



Gly Gly Lys lie Val Leu Asn lie Leu Glu Ala lie Arg Arg Lys Pro 
55 370 375 380 

Leu Lys Gin Glu lie Glu Thr Tyr Val Thr Leu Ala Gly Val Val lie 
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385 



390 



395 



400 



Met Val Val Leu Met lie Ala Val Thr Trp Asn Asp lie Met Arg Leu 

405 410 , ' 415 

: ' ' > 

Phe Phe Arg ' . 



<210> 148 
<211> 197 
<212> PRT 

<213> Streptococcus pneumoniae 
<400>,148 j 

Met Tyr Ala Tyr Leu Lys Gly lie lie Thr Lys lie Thr Ala Lys Tyr 

1 .5 10 ^ 15 f 

■ 

i 

lie Val Leu Glu Thr Asn Gly lie Gly Tyr lie Leu His Val Ala Asn 

20 * 25 30 

Pro Tyr Ala Tyr Ser Gly Gin Val Asn Gin Glu Ala Gin lie Tyr Val 
35 40 45 

• ; 

i , 

His Gin Val Val Arg Glu Asp Ala His Leu Leu Tyr Gly Phe Arg Ser 
50 .55 60 

Glu Asp Glu Lys Lys Leu Phe Leu Ser Leu lie Ser Val Ser Gly lie 
65 70 75 80 

Gly Pro Val Ser Ala Leu Ala lie lie Ala Ala Asp Asp Asn Ala Gly 

85 90 ' 95 

Leu Val Gin Ala lie Glu Thr Lys Asn lie Thr Tyr Leu Thr Lys Phe 

100 105 110 

Pro Lys lie Gly Lys Lys Thr Ala Gin Gin Met Val Leu Asp Leu Glu 
115 120 125 

Gly Lys Val Val Val Ala Gly Asp Asp Leu Pro Ala Lys Val Ala* Val 
130 135 140 " 

Gin Ala Ser Ala Glu Asn Glh Glu Leu Glu Glu Ala Met Glu Ala Met 
145 150 155 160 

Leu Ala Leu Gly Tyr Lys Ala Thr Glu Leu Lys Lys lie Lys Lys Phe 

165 170 175 

Phe Glu Gly Thr Thr Asp Thr Ala Glu Asn Tyr lie. Lys Ser Ala Leu 

180 185 190 

Lys Met Leu Val Lys 
195 



<210> 149 
<211> 257 
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10 



25 



40 
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<212> PRT 

<213> Streptococcus pneumoniae 
<400> 149 . ' 

Met Lys Asn Asn Arg lie Leu Ala Leu Ser Gly Asn Asp lie Phe £>er 
1 5 10 ' • ,' 15 • - , 

I 

Gly Gly Gly Leu Ser Aia Asp Leu Ala Thr Tyr Thr Leu Asn Gly Leu. 

20 - 25 30 

1 1 

His Gly Phe Val Ala Val Thr Cys Leu Thr Ala Leu Thr Glu Lys Gly 
35 40 45 



Phe Glu Val Phe Pro Thr Asp Asp Thr He Phe Gin His Glu Leu Asp 

15 50 55 i 60 

l 

Ser Leu Arg Asp Val Glu Phe Gly Gly He Lys He Gly Leu Leu Pro 

65 70 75 80 

20 Thr Val Ser Val Ala Glu Lys Ala Leu Asp Phe He Lys Gin Arg Pro 

85 90 95 



i ■ 



Gly Val Pro Val Val Leu Asp Pro Val Leu Val Cys Lys Glu Thr His 

100 ! 105 110 

Asp Val Ala Val Ser Glu Leu Cys Gin Glu Leu He Arg Phe Phe Pro 
115 120 125 



Tyr Val Ser Val lie Thr Pro Asn Leu Pro Glu Ala Glu Leu Leu Ser 
30 130 135 140 

Gly Gin Glu He Lys Thr Leu Glu Asp Met Lys Thr Ala Ala Gin Lys 
145 150 155 160 

35 Leu His Asp Leu Gly Ala Pro" Ala Val He lie Lys Gly Gly Asn Arg 

165 170 175 



Leu Ser Gin Asp Lys Ala Val Asp Val Phe Tyr -Asp Gly Gin Thr Phe 

180 185 190 

Thr He Leu Glu Asn Pro Val He Gin Gly Gin Asn Ala Gly Ala Gly 
195 - 200 205 



Cys Thr Phe Ala Ser Ser He Ala Ser His Leu Val Lys Gly Asp Lys 

45 210 215 220 

Phe Leu Pro Ala Val Glu Ser Ser Lys Ala Phe Val Tyr Arg Ala He 

225 230 235 240 

50 Ala Gin Ala Asp Gin Tyr Gly Val Arg Gin Tyr Glu Ala Asn Lys Asn 

245 250 255 



Asn 



<210> 150 



89 



INSDOCID: <WO 0149721A2_I_> 



WO 01/49721 PCT/US00/35604 



<211> 412 

<212> PRT ': 
<213> Streptococcus pneumoniae 

■ . i ' . 

<400> 150 1 ' ' ' . ' , ' 

Met lie Glu Thr Glu Lys Lys Glu Glu Arg Vai Leu Leu lie Gly Val 
1 5 10 15 

Glu Leu Gin Gly Met Asp Ser Phe Asp Leu Ser Met. Glu Glu Leu Ala 

20 ■ , 25, 30 

i' , 'i 

Ser Leu Ala Lys Thr Ala Gly Ala Val Val Val Asp Ser Tyr Arg Gin 

35 , 40 '< 1 i .45 

Lys Arg Glu Lys Tyr Asp Ser Lys Thr Phe Val Gly Ser Gly Lys Leu 
.50 55 60 

Glu Glu lie Ala Leu Met yal Asp Ala ,Glu Glu lie Thr Thr Val He 
65 70 75 80 

Val Asn Asn Arg Leu Thr Pro Arg Gin Asn Val Asn Leu Glu Glu Val 

1 85 90 95 

Leu Gly Val Lys Val He Asp Arg Met Gin Leu lie Leu Asp He Phe 

100 , 105 110 

Ala Met Arg Ala Arg Ser His Glu Gly Lys Leu Gin Val His Leu Ala 
115 120 , 125 

i 

Gin Phe Lys Tyr Leu Leu Pro Arg Leu Val Gly Gin Gly He Met Leu 
130 135 140 

K 

Ser Arg Gin Ala Gly Gly He Gly Ser Arg Gly Pro Gly Glu Ser Gin 
145 150 155 160 

Leu Glu Leu Asn Arg Arg Ser Val Arg Asn Gin He Thr Asp He Glu 

165 170 175 

Arg Gin Leu Lys Val Val Glu Lys Asn Arg Ala Thr Val Arg Glu Lys 

180 185 190 

Arg Leu Glu Ser Ser Thr Phe Lys He Gly Leu He Gly Tyr Thr Asn 
195 200 205 

Ala Gly Lys Ser Thr He Met Asn He Leu Thr Ser Lys Thr Gin Tyr 
210 215 220 

Glu Ala Asp Glu Leu Phe Ala Thr Leu Asp Ala Thr Thr Lys Ser He 
225 230 235 . 240 

His Leu Gly Gly Asn Leu Gin Val Thr Leu Thr Asp Thr Val Gly Phe 

245 250 255 

He Gin Asp Leu Pro Thr Glu Leu Val Ser Ser Phe Lys Ser Thr Leu 

260 265 270 

Glu Glu Ser Lys His Val Asp Leu Leu. Val His Val He Asp Ala Ser 



90 
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275 



280 



285 



10 



15 



20 



25 



Asn Pro Tyr His Glu Glu His Glu Lys Thr Val Leu Ser lie Met f,ys 

! 290 ' 295 ' 300 ' / ' 

I , t 

Asp Leu Asp Met Glu Asp lie Pro His Leu Thr Leu Tyr Asn Lys' Ala 1 
305 310 315 320 

Asp Leu Val Glu Asp Phe Thr Pro Thr Gin Thr Pro Tyr Thr Leu lie 

325 - 330 ' 335 1 

Ser Ala Lys Ser Glu Asp Ser Arg Glu Asn Leu Gin Ala Leu Leu Leu 



340 



345 



350 



Asp Lys lie Lys Glu lie Phe Glii Ala Phe Thr Leu Arg Val Pro Phe 
355 360 365 , 

i 

Ser Lys Ser Tyr Lys lie His Asp Leu Glu Ser Val Ala lie Leu Glu 
370- , 375 380 

Glu Arg Asp Tyr Gin Glu Asp Gly Glu Val lie Thr Gly Tyr lie Ser 
3-85 390 395 400 

r 

Glu Lys Asn Lys Trp Arg Leu Glii Glu Phe Tyr Asp 

405 ! 410 



t ■ 



30 



35 



40 



<210> 151 
<211> 160 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 151 

Met Ala Glu Lys Thr Tyr Pro Met Thr Leu Glu Glu Lys Glu Lys Leu 
15 10 15 

Glu Lys Glu Leu Glu Glu Leu Lys Leu Val Arg Arg Pro Glu Val Val 

20 25 30 

Glu Arg lie Lys lie Ala Arg Ser Tyr Gly Asp Leu Ser Glu Asn Ser 
35 . - 40 45 ' 



45 



Glu Tyr Glu Ala Ala Lys Asp^ Glu Gin Ala Phe Val Glu Gly Gin lie 
50 55 '60 

Ser Ser Leu Glu Thr Lys lie Arg Tyr Ala Glu lie Val Asn Ser Asp 
65 70 75 80 



Ala Val Ala Gin Asp Glu Val Ala lie Gly Lys Thr Val Thr lie Gin 

50 . 85 90 95 

Glu lie Gly Glu Asp Glu Glu Glu Val Tyr lie lie Val Gly Ser Ala 

100 105 110 

55 Gly Ala Asp Ala Phe Ala Gly Lys Val Ser Asn Glu Ser- Pro lie Gly 

115 120 125 
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Gin Ala Leu lie Gly Lys Lys Thr Gly Asp Thr Ala Thr lie Glu Thr 1 

13.0 ',135 ' ' 140 

Pro Val Gly Ser ' Tyr , Asp Val Lys lie lieu Lys Val' Glu Lys Thr Ala 

145 • 150 1 155 i . . ' ' 160 



10 

<210> 152 <' ' ■ , " ■ 

<211> 189 

<212> PRT 1 t j 

/ 1 

<213^ Streptococcus pneumoniae , 

i ' 
<400> 152 

Met Thr Lys Leu Leu Val Gly Leu Gly Asn Pro Gly Asp Lys Tyr Phe 
1 5 10 " 15 

20 Glu Thr Lys His Asn Val Gly Phe Met Leu lie Asp Gin Leu Ala Lys 

20 25 30 

i . 1 



Lys Gin Asn Val Thr Phe Thr His Asp Lys lie Phe Gin Ala Asp Leu 
35. 4*0 . 45 

« E 

Ala Ser Phe Phe Leu Asn Gly Glu Lys lie Tyr Leu Val Lys Pro Thr 
50 55 60 



Thr Phe Met Asn Glu Ser Gly Lys Ala Val His Ala Leu Leu Thr Tyr 

30 65 70 75 80 

Tyr Gly Leu Asp lie Asp Asp Leu Leu lie lie Tyr Asp Asp Leu Asp 

85 90 95 

35 Met Glu Val Gly Lys lie Arg Leu Arg .Ala Lys Gly Ser Ala Gly Gly 

100 105 . 110 



His Asn Gly lie Lys Ser lie lie Gin His lie Gly Thr Gin Val Phe 
115 120 125 

Asn Arg Val Lys lie Gly lie Gly Arg Pro Lys Ash Gly Met Ser Val 
130 135 140 



Val His His Val Leu Ser Lys Phe Asp Arg Asp Glu Tyr lie Gly lie 
45 145 150 155 160 

Leu Gin Ser Val Asp Lys Val Asp Asp Ser Val Asn Tyr Tyr Leu Gin 

165 . 170 175 

50 Glu Lys Asn Phe Glu Lys Thr Met Gin Arg Tyr Asn Gly 

180 185 



<210> 153 . 

55 <211> 283 

<212> PRT 

<213> Streptococcus pneumoniae 
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<400> 153 

Met lie Leu- lie ,Thr Gly Ala Asn Gly Gin, Leu Gly Thr Glu Leu Arg 
1 .51 ( ' 10 " , 15 

5 ' 

Tyr Leu Leu Asp' Glu Arg Asn Glu Glu Tyr Val .Ala Val. Asp Val Ala 

20 25 30 

i 

Lys Met Asp lie Thr Asn Glu. Glu Met Val Glu Lys -Val Phe Glu Glu 
10 35 ■ , ' 40 , ' 45 

■ 1 . 1 

Val Lys Pro Thr Leu Val "Tyr His Cys Ala Ala Tyr Thr Ala Val Asp 

\50 . , , 55 , 1 60 

i 

15 Ala Ala Glu Asp Glu Gly Lys Glu Leu Asp Phe 1 Ala* lie Asn Val Thr 
65 70 75 80 



20 



35 



50 



Gly" Thr Lys Asn yal Ala Lys Ala Ser Glu Lys His Gly Ala Thr Leu 

85 ,90 95 

i 

Val Tyr lie Ser Thr Asp Tyr Val Phe Asp Gly Lys Lys t Pro Val Gly 

' 100 , 105 110 



Gin Glu Trp' Glu Val Asp Asp Arg Pro Asp Pro Gin Thr Glu Tyr Gly 

25 115 120 125 

Arg Thr Lys Arg Met Gly Glu Glu Leu Val Glu Lys His Val Ser Asn 

130 135 140 

* 

30 • Phe Tyr lie lie Arg Thr Ala Trp Val Phe Gly Asn Tyr Gly Lys Asn 

145 150 . 155 160 



Phe Val Phe Thr Met Gin Asn Leu Ala Lys Thr His Lys Thr Leu Thr 

165 - 170 175 

Val' Val Asn Asp Gin Tyr Gly Arg Pro Thr Trp Thr Arg Thr Leu Ala 

180 185 190 



Glu Phe Met Thr Tyr Leu Ala Glu Asn Arg Lys Glu Phe Gly Tyr Tyr 

40 195 200 205 

His Leu Ser Asn Asp Ala Thr Glu Asp Thr Thr Trp Tyr Asp Phe Ala 

210 215 220 

45 Val Glu lie Leu Lys Asp Thr Asp Val Glu Val Lys' Pro Val Asp Ser 

225 230 235 240 



Ser Gin Phe Pro Ala Lys Ala Lys Arg Pro Leu Asn Ser Thr Met Ser 

245 250 255 

Leu Ala Lys Ala Lys Ala Thr Gly Phe Val lie Pro Thr Trp Gin Asp 

260 265 270 



' Ala Leu Gin Glu Phe Tyr Lys Gin Glu Val Arg 
55 275 280 
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<210> 154 
<211> 407 
<212> PRT 

<213> Streptococcus pneumoniae 



10 



<400> 154 

Met Lys Arg Ser Leu Asp Ser Arg Val Asp Tyr Ser Leu Leu Leu Pro 
1 5 10 15 ■ 



15 



Val Phe Phe Leu Leu Val He Gly Val Val Ala He Tyr He Ala Val 

20 ! 25 30 

i 

Ser His Asp Tyr Pro Asn Asn He Leu Pro He Leu *Gly Gin Gin Val 
35 40 45 



20 



Ala Trp He Ala Leu Gly Leu Vai He Gly Phe Val Val Met Leu Phe 

50 '55 60 

Asn Thr Glu Phe Leu Trp Lys Val Thr Pro Phe Leu Tyr lie Leu Gly 

65 70 75 80 



Leu Gly Leu Met He Leu Pro lie* Val Phe Tyr Asn Pro Ser Leu Val 

25 85 90 95 

Ala Ser Thr Gly Ala Lys Asn Trp Val Ser He Asn Gly He Thr Leu 

100 105 110 

30 Phe Gin Pro Ser Glu Phe Met Lys lie Ser Tyr He Leu Met Leu Ala 

115 120 125 



35 



Arg Val lie Val Gin Phe Thr Lys Lys His Lys Glu Trp Arg Arg Thr 

130 135 140 

Val Pro Leu Asp Phe Leu Leu He Phe Trp Met He Leu Phe Thr He 

145 150 155 160 



Pro Val Leu Val Leu Leu Ala Leu Gin Ser Asp Leu Gly Thr Ala Leu 
40 165 170 175 

Val Phe Val Ala He Phe Ser Gly He Val Leu Leu Ser Gly Val Ser 

180 185 190 

45 Trp Lys lie He He Pro Val Phe Val Thr Ala Val Thr Gly Val Ala 

195 200 205 



50 



Gly Phe Leu Ala He Phe He Ser Lys Asp Gly Arg Ala Phe Leu His 
210 215 220 

Gin He Gly Met Pro Thr Tyr Gin He Asn Arg He Leu Ala Trp Leu 
225 230 235 240 



Ash Pro Phe Glu Phe Ala Gin Thr Thr Thr Tyr Gin Gin Ala Gin Gly 
55 245 • 250 255 



Gin He Ala He Gly Ser Gly Gly Leu Phe. Gly Gin Gly Phe Asn Ala 
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260 



265 



270 



10 



Ser Asn Leu Leu lie Pro Val Arg Glu Ser Asp Met lie Phe Thr VaJ. 
275 280 1 285 

lie Ala Glu' Asp Phe Gly Phe lie Gly Ser Val Leu Val lie Ala'Leu- 
290 295 ' 300 

Tyr Leu Met Leu lie Tyr Arg Met Leu Lys lie, Thr Leu Lys Ser Asn 

305 310 ■ 315 1 320 

Asn Gin Phe Tyr Thr Tyr lie Ser Thr Gly Leu lie- Met Met Leu Leu' 



325 



330 



335 



1 , 

15 Phe His lie Phe Glu Asn lie Gly Ala Val Thr Gly Leu Leu Pro Leu 



340 



345 



350 



20 



25 



Thr Gly lie Pro Leu Pro Phe He Ser Gin Gly Gly Ser Ala He He 
355 360 365 

Ser Asn Leu He Gly Val Gly Leu Leu Leu Ser Met Ser Tyr Gin Thr 
370 375 380 

i 

Asn Leu Ala Glu Glu Lys Ser Gly Lys Val Pro Phe Lys Arg Lys Lys 
385 390 395 400 

Val Val Leu Lys Gin He Lys 

* 405 



30 



35 



40 



<210> 155 
<211> 202 
<212> PRT 

<213> Streptococcus pneumoniae' 
<400> 155 * 

Met Gly Lys He He Gly He Thr Gly Gly He Ala Ser Gly Lys Ser 
1-5 10 15 

Thr Val Thr Asn Phe Leu Lys His Gin Gly Leu Ser Ser Ser Gly Leu 

20 25 30 



45 



Pro Thr. Gin Cys Ser Thr Asn Tyr Arg Lys Pro Gly Gly Arg Leu Phe 
35 40 45 

* » 

Glu Ala Leu Val Gin His Phe Gly Gin Glu He He Leu Glu Asn Gly 
50 55 . 60 



Glu Leu Asn Arg Pro Leu He Ala Ser Leu He Phe Ser* Asn Pro Glu 
50 65 70 75 "80 



55 



Glu Gin Lys Trp Ser Asn Gin lie Gin Gly Glu He He Arg Glu Glu 

.85 90 95 

Leu Ala Thr Leu Arg Glu Gin Leu Ala Gin Thr Glu Glu lie Phe Phe 

100 105 110 
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F 

i 

* 

Met Asp lie Pro Leu Leu Phe Glu Gin Asp Tyr Ser Asp Trp Phe Ala 
115 ' t ■ 120 125 

i 

Glu Thr Trp Leu \^al Tyr Val Asp Arg Asp Ala Gin Val Glu Arg Leu' 
> 130 , '135 . ■ • , 140 ' : 



15 



40 



Met Lys Arg Asp Gin Leu Ser Lys Asp Glu Ala Glu" Ser Afg Met Ala 
145 150 155 160 

10 Ala Gin Trp Pro Leu Glu Lys Lys Lys Asp LeU ' Ala Ser Gin Val Leu 

165 ' 170 ■ * . ■ . 175 



Asp Asn Asn Gly Asn Gin Asn Gin JLeu Leu Asn Gin Val His lie Leu 
' 180 185 , 190 , 

i 

Leu Glu Gly Gly Arg Gin Asp Asp Arg Asp 
195 200 



* 1 ' 

20 <210> 156 • 
<211> 419 

<212> PRT f 
<213> Streptococcus pneumoniae 

25 <400> 156 

Met Arg Lys lie Val lie Asn Gly Gly Leu Pro Leu Gin Gly Glu lie 
1 5 10 15 • 

Thr lie Ser Gly Ala Lys Asn Ser Val Val Ala Leu lie Pro Ala He 
30 20 25 30 

He Leu Ala Asp Asp Val Val Thr Leu Asp Cys Val Pro Asp He Ser 
35 40 45 

35 Asp Val Ala Ser Leu Val Qlu He Met Glu Leu Met Gly Ala Thr Val 

50 55 60 . 



Lys Arg Tyr Asp Asp Val Leu Glu He Asp Pro Arg Gly* Val Gin Asn 
65 70 75 80 

He Pro Met Pro Tyr Gly Lys He Asn Ser Leu Arg Ala Ser Tyr Tyr 

85 90 95 



Phe Tyr Gly Ser Leu 

45 ioo 

Pro Gly Gly Cys Asp 
115 

50 Ala Phe Glu Ala Met 
' 130 



Leu Gly Arg Phe Gly Glu 

105 

Leu Gly Pro Arg Pro He 
120 

Gly Ala Thr Ala Ser Tyr 
135 



Ala Thr Val Gly Leu 
110 

Asp Leu His Leu Lys 
125 

Glu Gly Asp Asn Met 
140 



Lys Leu Ser Ala Lys Asp Thr Gly Leu His Gly Ala Ser He Tyr Met 
145 150 155 160 

55 

Asp Thr Val Ser Val Gly Ala Thr He Asn Thr Met He Ala Ala Val 

165 170 175 
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Lys Ala Asn Gly Arg Thr lie He Glu Asn Ala Ala Arg Glu Pro Glu 

!80 185. . 190 

• i ... . . 1 .i - .... 

He He Asp Val Ala ' Thr Leu Leu Asn' Asn Met Gly, Ala His lie Arg 
195 200 . • 205 , ' . ' , 

t 

Gly Ala Gly Thr Asn He He He He Asp Gly Val Glu Arg Leu His 
210 215 220 

Gly Thr Arg His Gin Val lie Pro Asp Arg lie Glii Ala Gly Thr Tyr 
225 230 235 240 

lie Se!r Leu 'Ala Ala Ala Val Gly Lys Gly He Arg He Asn Asn Val 

245 250 255 

Leu Tyr Glu His Leu Glu Gly Phe He Ala Lys Leu Glu Glu Met Gly 

260 265 270 

1 ■ 

i 

Val Arg Met Thr Val Ser Glu Asp Ser He Phe Val Glu Glu Gin Ser 
275 280 285 

• i - ! 

Asn Leu Lys Ala He Asn He Lys Thr Ala Pro Tyr Pro Gly Phe Ala 
230 295 300 

Thr Asp Leu Gin Gin Pro Leu Thr Pro Leu Leu Leu Arg Ala Asn Gly 
305 310 315 320 

Arg Gly Thr He Val Asp Thr lie Tyr Glu Lys Arg Val Asn His Val 

325 330 335 

Phe Glu Leu Ala Lys Met Asp Ala Asp He Ser Thr Thr Asn Gly His 

340 345 350 

He Leu Tyr Thr Gly Gly Arg Asp Leu Arg Gly Ala Ser Val Lys Ala 
355 360 365 

Thr Asp Leu Arg Ala Gly Ala Ala Leu Val He Ala Gly Leu Met Ala 
370 375 380 

Glu Gly Lys Thr Glu He Thr Asn He Glu Phe He Leu Arg Gly Tyr 
385 390 395 400 

Ser Asp He He Glu Lys Leu Arg Asn Leu Gly Ala Asp He Arg Leu 

405 410 415 

Val Glu Asp 



<210> 157 
<211> 231 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 157 

Met Ser Arg He Glu Phe Ser Pro Ser Leu Met Thr Met Asp Leu Asp 
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I 

1 5 1 10 15 

Lys Phe Lys Glu Gin lie Thr Phe Leu Asn Asp Lys Val Ala Ser Tyr 

20 25 , , ■ , 30 



His lie Asp lie Met Asp Gly His Phe Val Pro Asn, lie Thr Leu Ser 
35 ,40 45 ' 



Pro Trp Phe lie Gin Glu Val Gin Lys lie Ser Asp Thr Pro Leu Ser 
10 50 55 60 

Val His Leu Met Val Thr Asp Pro Thr Phe Trp Val Asp Gin Val Leu 



65 70 



75 , ,80 



15 Asp Leu Gin Cys Glu Tyr lie Cys lie His Ala Glu Val Leu Asn Gly 

85 1 90 95 



i ■ 



Leu Ala Phe Arg Leu lie Asp Lys lie His Asp Ala Gly Leu Lys Ala 

100 105 110 

Gly Val Val Leu Asn Pro Glu Thr. Pro Val Ser Thr lie. Phe Pro Tyr 
115 120 125 



lie Asp Leu Leu Asp Lys Val Thr He Met Thr Val Asp Pro Gly Ph'e 
25 130 135 140 

Ala Gly Gin Arg Phe Leu Glu Ser Thr Leu Tyr Lys He Gin Glu Leu 
145 150 155 160 

* 

30 Arg Gin Leu Arg Val Gin Asn Gly Tyr His Tyr He He Glu Met Asp 

165 170 175 



Gly Ser Ser Ser Arg Lys Thr Phe Lys Gin He Asp Val Ala Gly Pro 

180 185 190 

Asp He Tyr Val He Gly Arg Ser Gly Leu Phe Gly Leu Asp Asp Asp 
195 200 205 



lie Ala Lys Ala Trp Asp He Cys Ser Arg Asp Tyr Glu Glu Met Thr 
40 210 215 220 



Gly Lys Thr Met Pro He Lys 
225 230 



<210> 158 
<211> 374 
<212> PRT 

<213> Streptococcus pneumoniae 



<400> 158 

Met Arg Asn Met Ala Leu Thr Ala Gly He Val Gly Leu Pro Asn Val 
1 5 10 ' 15 

55 Gly Lys Ser Thr Leu Phe Asn Ala He Thr Lys Ala Gly Ala Glu Ala 

20 25 30 
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Ala Asn Tyr Pro' Phe Ala Thr lie Asp Pro Asn Val Gly Met Val Glu 
35 40 45 

Asp Pro Asp Glu Argi Leu Gin Lys Leu Thr Glu Met lie Thr Pro -Lys , 
5 50 ' 55 ' " 1 60 

< • ) 

Lys Thr Val Pro Thr Thr Phe Glu Phe Thr Asp lie Ala Gly lie Val 
65 70 75 80 

■ * 

10 Lys Gly Ala Ser Lys, Gly.' Glu Gly Leu. Gly Ash Lys Phe Leu Ala Asn 

85 , 9Q , ' • . " ' , 95 

lie Arg Glu, Val Asp Ala lie Val 'His Val Val Arg, Ala Phe Asp Asp 

1 100 105 ' ■ 110 

Glu Asn Val Met Arg Glu Gin Gly Arg Glu Asp. Ala Phe Val Asp Pro 
115 120 ' 125 

Leu Ala Asp lie Asp Thr lie Asn Leu Glu Leu lie Leu Ala Asp Leu 
20. 130 135 * 140 

i ■ ■' 
Glu Ser Val Asn Lys Arg Tyr Ala Arg Val Glu Lys Met Ala Arg Thr 

145 150 155 160 

25 Gin Lys Asp Lys Glu Ser Val Ala Glu Phe Asn Val Leu Gin Lys lie 

165 170 175 

Lys Pro Val Leu Glu Asp Gly Lys Ser Ala Arg Thr lie Glu Phe Thr 

180 185 190 

Asp Glu Glu Gin Lys Val Val Lys Gly Leu Phe Leu Leu Thr Thr Lys 
195 200 205 



30 



Pro Val Leu Tyr Val Ala Asn Val Asp Glu Asp Val Val Ser Glu Pro 
35 210 215 220 

Asp Ser lie Asp Tyr Val Lys Gin lie Arg Glu Phe Ala Ala Thr Glu 
225 230 . 235 240 

40 Asn Ala Glu Val Val Val lie Ser Ala Arg Ala Glu Glu Glu lie Ser 

245 250 255 



45 



Glu Leu Asp Asp Glu Asp Lys Lys Glu Phe Leu Glu Ala lie Gly Leu 

260 265 270 

Thr Glu Ser Gly Val Asp Lys Leu Thr Arg Ala Ala Tyr His Leu Leu 

275 280 285 



Gly Leu Gly Thr Tyr Phe Thr Ala Gly Glu Lys Glu Val Arg Ala Trp 
50 290 295 300 

Thr Phe Lys Arg Gly Met Lys Ala Pro Gin Ala Ala Gly lie lie His 
305 310 315 • 320 

55 Ser Asp Phe Glu Lys Gly Phe lie Arg Ala Val Thr Met Ser Tyr Glu 

325 330 335 
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Asp Leu Val Lys Tyr Gly Ser Glu Lys Ala Val Lys. Glu Ala Gly Arg 

340. 345 350 

Leu Arg Glu Glu Gly Lys Glu Tyr lie Val Gin Asp Gly* Asp lie Met 

355 360 365 

i i 

1 i 

Glu Phe Arg Phe Asn Val , ' 

370 



10 



15 



20 



<210> 159 
<211> 110 
<212> PRT 

<213> Streptococcus pneumoniae 1 

, • ■ ! • 

<400> 159 

Met- Glu lie Glu Lys Thr Asn Arg Met Asn Ala Leu Phe Glu Phe Tyri 
1- 5 : 10 15 

* 

Ala Ala Leu Leu Thr Asp Lys Gin Met Asn Tyr lie Glu Leu Tyr Tyr 

20 25 30 



25 



Ala Asp Asp Tyr Ser Leu Ala Glu lie Ala Glu Glu Phe Gly Val Ser 

.35 ' 40 45 

i j 

Arg Gin Ala Val Tyr Asp Asn lie Lys Arg Thr Glu Lys lie Leu Glu 
50 55 60 



Asp Tyr Glu Met Lys Leu His Met Tyr Ser Asp Tyr lie Val Arg Ser 
30 65 70 75 80 

Gin lie Phe Asp Gin lie Leu Glu Arg Tyr Pro Lys Asp Asp Phe Leu 

85 90 95 

35 Gin Glu Gin lie Glu He Leu Thr Ser He Asp Asn Arg Glu 

• 100 105 110 



<210> 160 
40 <211> 223 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 160 

45 Met Thr Leu Glu Trp Glu Glu Phe Leu Asp Pro Tyr He Gin Ala Val 
15 10 15 



50 



Gly Glu Leu Lys He Lys Leu Arg Gly He Arg Lys Gin Tyr Arg Lys 

20 25 30 

Gin Asn Ly£ His Ser Pro He Glu • Phe Val Thr Gly Arg Val Lys Pro 
35 40 45 



He Glu Ser He Lys Glu Lys Met Ala Arg Arg Gly He Thr Tyr Ala 
55 50 . 55 60 



Thr Leu Glu His Asp Leu Gin Asp lie Ala Gly Leu Arg Val Met Val 



100 
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65 



70 



75 



80 



Gin Phe Val Asp Asp Val Lys Glu Val Val Asp lie Leu His Lys Arg 

i 85 90 ■ . ' 1 95 ' 

5 = " ' , 

Gin Asp Met Arg lie lie Gin . Glu Arg Asp Tyf Ile Ttir His Arg- Lys, 

100 105 110 

Ala Ser Gly Tyr Arg Ser Tyr His Val Val Val Glu Tyr Thr Val Asp 
10 115 120 125 

Thr lie Asn Gly Ala Lys Thr Il:e Leu Ala Glu lie Gin He Arg Thr i 



15 



20 



' 130 



135 



140 



Leu Ala Met Asn Phe Trp Ala Thr lie Glu His Ser Leu Asn Tyr Lys 

145 150 155 160 

Tyr Gin Gly Asp Phe Pro Asp Glu lie Lys Lys Arg Leu Glu lie Thr 

165. 170 175 

Ala Arg lie Ala His Gin Leu Asp Glu Glu Met Gly Glu lie Arg Asp 

180 185 190 



i * 



Asp lie Gin Glu Ala Gin Ala Leu Phe Asp Pro Leu Ser Arg Lys Leu 
25 195 200 205 



30 



35 



Asn Asp Gly Val Gly Asn Ser Asp Asp Thr Asp Glu Glu Tyr Arg 
210 215 220 



<210> 161 
<211> 195 
<212> PRT- 

<213> Streptococcus pneumoniae 
<400> 161 

Met Glu Leu Asn Thr His Asn Ala Glu lie Leu Leu Ser Ala Ala Asn 
15 10 15 



40 Lys Ser His Tyr Pro Gin Asp Glu Leu Pro Glu lie Ala Leu Ala Gly 

20 25 30 



.45 



Arg Ser Asn Val Gly Lys Ser Ser Phe lie Asn Thr Met Leu Asn Arg 
- 35 40 45 

Lys Asn Leu Ala Arg Thr Ser Gly Lys Pro Gly Lys Thr Gin Leu Leu 
50 55 60 



Asn Phe Phe Asn lie Asp Asp' Lys Met Arg Phe Val Asp Val Pro Gly 
50 65 70 75 80 

Tyr Gly Tyr Ala Arg Val Ser Lys Lys Glu Arg Glu Lys Trp Gly Cys 

85 90 95 

55 Met lie Glu Glu Tyr Leu Thr Thr Arg Glu Asn Leu Arg Ala Val Val 

. 100 105 110 
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Ser Leu Val Asp Leu Arg His Asp Pro Ser Ala Asp Asp Val Gin Met « 
115 • 120 125 

Tyr Glu Phe Leu Lys Tyr Tyr Glu lie Pro Val lie, lie Val Ala Thr 

5 130 , ' 135 , ,140 

i ■ 

• ' t 

i ' , ■• , 

Lys Ala Asp Lys lie Pro Arg Gly Lys Trp Asn Lys His Glu Ser Ala 

145 150 155 160 

10 lie Lys Lys Lys Leu Asa Phe Asp Pro Ser Asp Asp Phe lie Leu Phe 

165 ' 170 , . , 175 1 



15 



40 



45 



Ser Ser Val Ser Lys Ala Gly Met , Asp Glu Ala Trp Asp Ala lie Leu 

- 180 , 185 190 

* i 

i 

Glu Lys Leu 
195 



20 <210> 162 
<211> 97 
<212> PRT i 

<213> Streptococcus pneumoniae 
25 <400> 162 

Met Lys Thr Arg Lys lie Pro Leu Arg Lys Ser Val Val Ser Asn Glu 
1 5 .10 15 

Val lie Asp Lys Arg Asp Leu Leu Arg lie Val Lys Asn Lys Glu Gly 
30 20 25 30 

Gin Val Phe lie Asp Pro Thr Gly Lys Ala Asn Gly Arg Gly Ala Tyr 
35 40 45 

35 lie Lys Leu Asp Asn Ala Glu Ala Leu Glu Ala Lys Lys Lys Lys Val 

50 55 60 



Phe Asn Arg Ser Phe Ser Met Glu Val Glu Glu Ser Phe Tyr Asp Glu 
65 70 75 80 

Leu lie Ala Tyr Val Asp His Lys Val Lys Arg Arg Glu Leu Gly Leu 

85 90 95 

Glu 



<210> 163 
<211> 103 
50 <212> PRT 

<213> Streptococcus pneumoniae 

<400&*'163 

Met Leu Lys Pro Ser lie Asp Thr Leu Leu Asp Lys Val Pro Ser Lys 
55 1 5 10 15 

Tyr Ser Leu Val lie Leu Glu Ala Lys Arg Ala His Glu Leu Glu Ala 
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20 , ,25 30 , 

f 

Gly Ala Pro Ala Thr Gin Gly Phe Lys Ser Glu Lys Ser Thr Leu Arg 

35 " < ' , 40 ' 45 ( 

Ala Leu Glu Glu' lie Glu Ser Gly Asn Val Thr lie His Pro Asp' Pro 
50 55 60 

Glu Gly Lys Arg Glu Ala Val Arg Arg Arg lie Glu Glu Glu Lys Arg 

65 ,70 - 75 80 

. )i ■ » 

Arg Lys Glu Glu Glu Glu Lys Lys lie Lys Glu Glh lie Ala Lys Glu 

.85 / ' 1 90 • 95 



15 Lys Glu Asp Gly Glu Lys lie 

100 



<210> 164 

20 <211> 103 

<212> PRT ( " i 

<213> Streptococcus pneumoniae 

<400> 164 

25 Met Ser Leu Thr Ser Lys Gin Arg Ala Phe Leu Asn Ser Gin Ala His 
1 5 10 15 



Thr Leu Lys Pro lie lie Gin lie Gly Lys Asn Gly Leu Asn Asp Gin 

20 25 30 

* 4 * 

t 

lie Lys Thr Ser Val Arg Gin Ala Leu Asp Ala Arg Glu Leu lie Lys 
35 40 45 



Val Thr Leu Leu Gin Asn Thr Asp Glu Asn lie His Glu Val Ala Glu 
35 50 55 . 60 

lie Leu Glu Glu Glu lie Gly Val Asp Thr Val Gin Lys lie Gly Arg 
65 70 75 80 

40 lie Leu lie Leu Phe Lys Gin Ser Ser Lys Lys Glu Asn Arg Lys lie 

85 90 95 



Ser Lys Lys Val Lys Glu lie 

100 



<210> 165 

<211> 175 

<212> PRT 

50 <213> Streptococcus pneumoniae 



<400> 165 

Met Ala lie Glu Asn Tyr lie Pro Asp Phe Ala Val Glu Ala Val Tyr 
1 5 10 15 

Asp Leu Thr Val Pro Ser Leu Gin Ala Gin Gly lie Lys Ala Val Leu 

20 25 30 
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10 



25 



50 



Val Asp Leu Asp Asn Thr Leu lie Ala Trp Asn Asn Pro Asp Gly Thr 
35 40 45 

Pro Glu Met Lys Gin Trp Leu His Asp Leu Arg Asp Ala Gly lie 6ly 

50 55, 1 60 • ' . , 

lie lie Val Val Ser Asn Asn Thr Lys Lys .Arg Val Gin Arg Ala Val 

65 70 75 80 

Glu Lys Phe Gly lie Asp Tyr Val Tyr Trp Ala Leu Lys Pro. Phe Thr 

85 ! 90 95 



Phe Gly lie Asp Arg Ala Met Lys Glu Phe His Tyr Asp Lys Lys Glu 
15 , " ■ - 100 I 105 110 

Val Val Met Val Gly Asp Gin Leu Met Thr Asp lie Arg Ala Ala His! 
115 ' 120 125 

20 Arg Ala Gly lie Arg Ser lie Leu Val Lys Pro Leu Val Gin His Asp 

130 135 140 



Ser lie Lys Thr Gin lie Asn Arg Thr Arg Glu Arg* Arg Val Met Arg 

145 150 ! ( 155 160 

Lys lie Thr Glu Lys Tyr Gly Pro lie Thr Tyr Lys Lys Gly lie 

165 170 175 



30 <"210> 166 
<211> 455 
<212> PRT 

<213> Streptococcus pneumoniae 
35 <400> 166 

Met Phe Arg Lys lie Leu lie Ala Asn Arg Gly Glu lie Ala Val Arg 
15 10 15 

lie lie Arg Ala Ala Arg Glu Leu Gly lie Ala Thr Val Ala Val Tyr 
40 20 25 30 

Ser Thr Ala Asp Lys Glu Ala Leu His Thr Leu Leu Ala Asp Glu Ala 
*35 * 40 45 

45 Val Cys lie Gly Pro Gly Lys Ala Thr Glu Ser Tyr Leu Asn lie Asn 

50 55 60 



Ala Val Leu Ser Ala Ala Val Leu Thr Glu Ala Glu Ala lie His Pro 

65 70 75 80 

Gly' Phe Gly Phe Leu Ser Glu Asn Ser Lys Phe Ala Thr Met Cys Glu 

85 90 95 



Glu lie - Gly He Lys Phe He Gly Pro Ser Gly His Val Met Asp Met 
55 100 105 110 

Met Gly Asp Lys He Asn Ala Arg Ala Gin Met He Lys Ala Gly Val 
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! 

115 120 125 

Pro Val He Pro Gly Ser Asp Gly Glu Val His Asn Ser Glu Glu Ala 

130 135 ' . ' 140:' " *' ' , 

Leu He Val Ala Glu Lys He Gly Tyr Pro Val Met' Leu Lys Ala Ser ' 
145 150 155 160 



Ala Gly Gly Gly Gly Lys Gly lie Arg Lys Val Glu Lys Pro Asp Asp 
10 165 170 ' - 175 ' 

Leu Val Ser Ala Phe Glu Thr Ala Ser Ser Glu Ala Lys Ala Asn Tyr 

180 j 185 190 ,■ 1 

15 Gly Asn Gly Ala Met Tyr He Glu Arg Val He Tyr Pro Ala Arg His 

195 200 205 



lie Glu Val Gin He Leu Gly Asp Glu His Gly His Val lie His Leu 

210 215 220 

Gly Glu Arg Asp Cys Ser Leu Gin Arg Asn Asn Gin Lys Val Leu Glu 

225 230 1 235 240 



Glu Ser Pro Ser lie Ala He Gly Lys Thr Leu Arg His Glu He Gly 

25 245 250 255 

' Ala Ala Ala Val Arg Ala Ala Glu Phe Val Gly Tyr Glu Asn Ala Gly 

260 265 270 

30 Thr He Glu Phe Leu Leu Asp Glu Ala Ser Ser Asn Phe Tyr Phe Met 

275 280 . 285 



Glu Met Asn Thr Arg Val Gin Val Glu His Pro Val Thr Glu Phe Val 

290 295 300 

Ser Gly Val Asp He Val Lys Glu Gin lie Cys lie Ala Ala Gly Gin 
305 310 315 320 



Pro Leu Ser Val Lys Gin Glu Asp lie Val Leu Arg Gly His Ala lie 

40 325 330 335 

Glu Cys Arg lie Asn Ala Glu Asn Pro Ala Phe Asn Phe Ala Pro Ser 

340 345 350 

45 Pro Gly Lys lie Thr Asn Leu Tyr Leu Pro Ser Gly Gly Val Gly Leu 

3'55 360 365 



Arg Val Asp Ser Ala Val Tyr Pro Gly Tyr Thr He Pro Pro Tyr Tyr 
370 375 380 

Asp Ser Met lie Ala Lys lie lie Val His Gly Glu Asn Arg Phe Asp 

385 390 395 400 



Ala Leu Met Lys Met Gin Arg Ala Leu Tyr Glu Leu Glu lie Glu Gly 
55 405 410 '415 

Val 'Gin Thr Asn Ala Asp Phe Gin Leu Asp Leu He Ser Asp Arg Asn 
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1 

420' 425 430 

Val lie Ala Gly Asp Tyr Asp Thr Cys Phe Leu Met Glu Thr Phe Leu 

435 i 440 ' ■' ' ' " 445 



i • 



Pro Lys Tyr Gin Glu Lys Glu 
450 455 



10 <210> 167 

<211> 77 ' , , ' 

<212> PRT 

i ■ 1 

<213> Streptococcus pneumoniae ! , < i 

15 <400> 167 , 

Met lie Tyr Lys Val Phe Tyr Gin Glu Thr Lys , Glu Arg Ser Pro Arg 
1 5 10 - 15 

Arg Glu Thr Thr Arg Ala Leu Tyr- Leu Asp lie Asp Thr Ser Ser Glu 
20 20 25 30 

Leu Glu Gl ! y Arg lie Thr Ala Arg Gin Leu Val Glu Glu Asn Arg Pro 
35 ' 40 45 

25 Glu Tyr Asn lie Glu Tyr lie Glu Leu Leu Ser Asp Lys Leu Leu Asp 

50 55 60 



Tyr Glu Lys Glu Thr Gly Ala Phe Glu lie Thr Glu Phe 
65 70 75 



<210> 168 
<211> 336 
<212> PRT 
35 <213> Streptococcus pneumoniae 



<400> 168 

Met Lys Asp Arg Tyr lie Leu Ala Phe Glu Thr Ser Cys Asp Glu Thr 
1 .'5 . 10 15 

Ser Val Ala Val Leu Lys Asn Asp Asp Glu Leu Leu Ser Asn Val lie 

20 25 30 



Ala Ser Gin lie Glu- Ser His Lys Arg Phe Gly Gly Val Val Pro Glu 
45 35 40 45 

Val Ala Ser Arg His His Val Glu Val He Thr Ala Cys He Glu Glu 
50 55 60 

50 Ala Leu Ala Glu Ala Gly He Thr Glu Glu Asp Val Thr Ala Val Ala 
65 70 75 80 



Val Thr Tyr Gly Pro Gly Leu Val Gly Ala Leu Leu Val Gly Leu Ser 

85 90 95 

Ala Ala Lys Ala Phe Ala Trp Ala His Gly Leu Pro Leu He Pro Val 

100 105 110 
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Asn His Met Ala Gly His Leu Met Ala Ala Gin Ser Val Glu Pro Leu 
115 120 125 

Glu Phe Pro Leu Leu Ala Leu Leu Vpil Ser Gly Gly His Thr Glu Leu 
130 135. ' 140' • , 



10 



15 



Val Tyr Val Ser Glu Ala Gly Asp Tyr Lys lie Val Gly Glu Thr Arg 
145 150 1$5 y 160 

i ■ it 

Asp Asp Ala Val Gly Glu Ala Tyr Asp Lys Val Gly Arg Val Met Gly 



165 



170 



175 



Leu Thr Tyr Pro Ala Gly Arg Glu lie Asp Glu Leu Ala His Gin Gly 

i ' 180 ! 185 190 * 



Gin Asp lie Tyr Asp Phe Pro Arg Ala Met lie Lys Glu Asp Asn Leu 1 
195 200 205 

20 Glu Phe Ser Phe Ser Gly Leu Lys Ser Ala Phe lie Asn Leu His His 
210 215 220 



25 



Asn Ala Glu Gin Lys Gly Glu Ser Leu Ser Thr Glu Asp Leu Cys Ala 

225 230 j , . 235 240 

Ser Phe Gin Ala Ala Val Met Asp lie "Leu Met Ala Lys Thr Lys Lys 

245 250 255 



Ala Leu Glu Glu Tyr Pro Val Lys Thr Leu Phe Val Ala Gly Gly Val 
30 260 265 270 

* 

Ala Ala Asn Lys Gly Leu Arg Glu Arg Leu Ala Ala Glu lie Thr Asp 
275 280 285 

35 Val Lys Val lie lie Pro Pro Leu Arg Leu Cys Gly Asp Asn Ala Gly 
■290 295 300 



40 



Met lie Ala Tyr Ala Ser Val Ser Glu Trp Asn Lys Glu Asn Phe Ala 
305 310 315 320 

Gly Trp Asp Leu Asn. Ala Lys Pro Ser Leu Ala Phe Asp Thr Met Glu 

325 330 335 



45 



<210> 169 
<211> 602 
50 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 169 

Met Cys Gly lie Val Gly Val Val Gly Asn Thr Asn Ala Thr Asp lie * 
55. 1 5 . 10' ^ 15 

Leu lie Gin Gly Leu Glu Lys Leu Glu Tyr Arg Gly Tyr Asp Ser Ala 
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20' 2,5 30 , ' 

Gly lie Phe Val Leu Asp Gly Ala Asp Asn His Leu Val Lys Ala Val 

35 ' ! , 40 11 1 45 



11 i 



Gly Arg He Ala Glu Leu Ser Ala Lys Thr Ala Gly, Val Glu Gl^ ,Thr 
50 ( 55 1 60 

Thr Gly He Gly His Thr Arg Trp Ala Thr His Gly, Lys Pro Thr Glu 

65 70 1 | 75 1 ,80 

Asp Asn Ala His Pro His Arg Ser Glu Thr Glu Arg Phe Val Leu Val 

85 ■ t 90 ' 95 

i > ■ 

His Asn Gly Val lie Glu Asn Tyr Leu Glu lie. Lys Glu Glu Tyr Leu 

100 105 110 

i 

Ala Gly His His Phe Lys Gly Gin Thr Asp Thr Glu He Ala Val His 
115 120 ' 12,5 

i 

Leu He Gly Lys Phe Ala Glu Glu Glu Gly Leu Ser Val Leu Glu Ala 
130 1 135 140 

Phe Lys Lys Ala Leu His lie He Arg Gly Ser Tyr Ala Phe Ala Leu 
145 150 155 160 

He Asp Ser Glu Asn Pro Asp Val He Tyr Val Ala Lys Asn Lys Ser 

165 - 170 175 

Pro Leu Leu He Gly Leu Gly Glu Gly Tyr Asn Met Val Cys Ser Asp 

180 185 190 

Ala Met Ala Met He Arg Glu Thr Asn Gin Tyr Met Glu He His Asp 
• 195 200 205 

Gin Glu Leu Val He Val Lys Ala Asp" Ser Val Glu Val Gin Asp Tyr 
210 215 220 

Asp Gly Asn Ser Arg Glu Arg Ala Ser Tyr Thr Ala Glu Leu Asp Leu 
225 230 235 240 

Ser Asp He Gly Lys Gly Thr Tyr Pro Tyr Tyr Met Leu Lys Glu He 

245 250 ■ 255 

Asp Glu Gin Pro Thr Val Met Arg Lys Leu lie Gin Ala Tyr Thr Asp 

260 265 270 

Asp Ala Gly Gin Val Val Val Ala Pro Ala lie lie Lys Ala Val Gin 
275 280 285 

Asp Ala Asp Arg He Tyr He Leu Ala Ala Gly Thr Ser Tyr His Ala 
290 295 300 

Gly Phe Ala Ser Lys Lys Met Leu Glu Glu Leu Thr Asp Thr Pro Val 
305 310 315 320 

Glu Leu Gly lie Ser Ser Glu Trp Gly Tyr Gly Met Pro Leu Leu Ser 



108 



WO 01/49721 PCT/US00/35604 

325 . 330 3.35. 

Lys Lys Pro Leu Phe lie Phe lie Ser Gin Ser Gly Glu Thr Ala Asp 

340 ' , 345 > ' ' 350 

■ s • 1 • ■ . . ;• 

Ser Arg Gin Val' Leu Val Lys Ala Asn Glu Met- Gly, Ile ( Pro Ser 1 Leu ■ 
355 i 360 * 365 ' 

Thr Val Thr Asn Val Pro Gly Ser Thr Leu Ser Arg , Glu Ala Asn Tyr 
10 370 375 ( 380 

Thr Met Leu Leu His Ala Gly Pro Glu lie Ala Val Ala Ser. Thr Lys 
385 390 1 , ■ i 395 400 

15 Ala Tyr Thr Ala G ( ln lie Ala Ala Leu Ala Phe- Leu Ala Lys Ala Val 

405 410 415 



20 



35 



50 



Gly Glu Ala Asn Gly Asn Ala Lys Ala Gin Ala Phe Asp Leu Val His 

420 425 ' 430 

Glu Leu Ser lie Val Ala Gin Ser lie Glu Ser Thr Leu Ser Glu Lys 
43'5 440 445 



Glu Thr lie Glu Ala Lys Val Arg Glu Leu Leu Glu Thr Thr Arg Asn 
25 450 455 460 

* 

Ala Phe Tyr He Gly Arg Gly Gin Asp Tyr Tyr Val Ala Met Glu Ala 
465 470 475 480 

30 Ser Leu Lys Leu Lys Glu He Ser' Tyr He Gin Cys Glu Gly Phe Ala 

485 ' 490 495 



Ala Gly Glu Leu Lys His Gly Thr He Ala Leu He Glu Glu Gly Thr 

500 505 510 

Pro Val Leu Ala Leu Leu Ser Asp Pro Val Leu Ala Asn His Thr Arg . 

515 520 525 



Gly Asn He Gin Glu Val Ala Ala Arg Gly Ala Lys Val Leu Thr He 
40 530 535 540 

Ala Glu Glu Asn Val Ala Lys Asp Thr Asp Asp He Val Leu Thr Thr 

545 550 ' 555 560 

45 Val His Pro Tyr Leu Ser Pro He Ser Met Val Val Pro Thr Gin Leu 

565 570 575 



Val Ala Tyr Phe Ala Thr Leu His Arg Gly Leu Asp Val Asp Lys Pro 

580 58.5 590 

Arg Asn Leu Ala Lys Ser Val Thr Val Glu 
595 600 



55 <210> 170 
<211> 240 
<212> PRT 
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<213> Streptococcus pneumoniae 
<400> 170 

, Met lie Arg lie Glu Asn Leu Ser Val Ser Tyr Lys Glu Thr Leu Ala , ' , 
5 1 5 10 15 ' , ■ 

Leu Lys Asp lie Ser Leu Val Leu His Gly Pro Thr lie Thr Gly lie 

20 25 30 

10 lie Gly Pro Asn Gly Ala Gly Lys Ser Thr Leu Leu Lys Gly Met Leu 

35 40 45 



15 



Gly lie lie Pro His Gin Gly Gin Ala Phe Leu Asp Asp Lys Glu Val 

50 55 1 60 

i 

i + p 

t ■ 

Lys Lys Ser Leu His Arg lie Ala Tyr Val Glu Gin Lys lie Asn lie 

65 70 i 75 8CI 1 



Asp Tyr Asn Phe Pro lie Lys Val Lys Glu Cys Val Ser Leu Gly Leu 
20 85 90 95 

Phe Pro Ser lie Pro Leu Phe Arg Ser Leu Lys Ala Lys His Trp Lys 

100 , 105 - 110 

i 

25 Lys Val Gin Glu Ala Leu Glu lie Val Gly Leu Ala Asp Tyr Ala Glu 

115 120 125 



30 



Arg Gin lie Ser Gin Leu Ser Gly Gly Gin Phe Gin Arg Val Leu lie 

130 135 140 

Ala Arg Cys Leu Val Gin Glu Ala Asp Tyr lie Leu Leu Asp Glu Pro 

145 150 155 160 



Phe Ala Gly "lie Asp Ser Val Ser Glu Glu He He Met Asn Thr Leu 
35 165 170 175 

Arg Asp Leu Lys Lys Ala Gly Lys Thr Val Leu He Val His His Asp 

180 185 190 

40 Leu Ser Lys He Pro His Tyr Phe Asp Gin Val Leu Leu Val Asn Arg 

195 200 205 



45 



Glu Val He Ala Phe Gly Pro Thr Lys Glu Thr Phe Thr Glu Thr Asn 

210. 215 220 

Leu Lys Glu Ala Tyr Gly Asn Gin Leu Phe Phe Asn Gly Gly Asp Leu 

225 230 235 240 



50 



<210> 171 
<211> 740 
55 <212> PRT 

<213> Streptococcus pneumoniae 
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<400> 171 ■ , 

Met Pro Lys Glu Val Asn Leu Thr Gly Glu Glu Val Val Ala Leu Thr 
1 5 10 15 



Lys Glu Tyr Leu Thr Glu Glu Asp Val His Phe Val His Lys Ala Leu 

20 25 30 ■ , 



10 



15 



Val Tyr Ala Val Glu Cys His Ser Gly Gin Tyr Arg Lys Ser Gly Glu. 
35 ■ 40 45 

r 

Pro Tyr lie lie His Pro lie Gin Val Ala Gly lie Leu Ala, Lys Leu 



50 



55 



60 



Lys Leu Asp Ala Val Thr Val Ala Cys Gly Phe Leu His Asp Val Val 
65 70 i 75 80 



Glu Asp Thr Asp Ala Thr Leu Asp Asp Leu Glu Arg Glu Phe Gly Pro ' 

85 90 95 

20 Asp Val Arg Val lie Val Asp Gly Val Thr Lys Leu Gly Lys Val Glu 

100 : 105 110 



25 



Tyr Lys Ser lie Glu Glu Gin Leu Ala Glu Asn His Arg Lys Met Leu 
115 1 120 , 125 

Met Ala Met Ser Glu Asp- lie Arg Val lie- Leu Val Lys Leu Ser Asp 
130 135 140 



Arg Leu His Asn Met Arg Thr Leu Lys His Leu Arg Lys Asp Lys Gin 

30 145 150 155 160 

Glu Arg lie Ser Lys Glu Thr Met Glu lie Tyr Ala Pro Leu Ala His 

165 170 175 

35 Arg Leu Gly He Ser Ser Val Lys Trp Glu Leu Glu Asp Leu Ser Phe 

180 185 190 



40 



Arg Tyr Leu Asn Pro Thr Glu Phe Tyr Lys He Thr His Met Met Lys 
■195 200 205 

Glu Lys Arg Arg Glu Arg Glu Ala Leu Val Asp Glu Val Val Thr Lys 
210 215 220 



Leu Glu Glu Tyr Thr Thr Glu Arg His Leu Lys Gly Lys lie Tyr Gly 
45 225 230 235 240 

Arg Pro Lys His He Tyr Ser He Phe Arg Lys Met* Gin Asp Lys Arg 

245 250 255 

50 Lys Arg Phe Glu Glu He Tyr Asp Leu He Ala lie Arg Cys He Leu 

260 265 270 



55 



Asp Thr Gin Ser Asp Val Tyr Ala Met Leu Gly Tyr Val His Glu Phe 
275 280 285 

Trp Lys Pro Met Pro Gly Arg Phe Lys Asp Tyr He Ala Asn Arg Lys 
290 295 300 



111 



JNSDOC1D: <WO 0149721A2J_> 



WO 01/49721 PCT/US00/35604 



10 



25 



40 



55 



Ala Asn Gly Tyr Gin Ser lie His Thr Thr Val Tyr Gly Pro Iiys' Gly 
305 310' 315 320 

1 

Pro lie Glu Phe Gin' lie Arg Thr Lys. Glu Met His Glu Val Ala Glu ' 

325 330 ' 335 

1 ■ i . . 

i 1 

Tyr Gly Val Ala Ala His Trp Ala Tyr Lys Lys Gly He Lys Gly Gin 

340 345 350 

i 

i * t 

Val Asn Ser Lys Glu' Ser Ala He Gly' Met Asn Trp He Lys, Glu Met 
355 1 360 ' 365 '. 



■I , • 

Met Glu Leu 'Gin Asp Gin Ala Asp Asp Ala Lys .Glu Phe Val Asp Ser 

15 370 , ' ■ 375 ' , 380 

Val Lys Glu Asn Tyr Leu Ala Glu Glu He ,Tyr Val Phe Thr Pro Asp 

385 390 395 400 

i 

20 Gly Ala. Val Arg Ser Leu Pro Lys Asp Ser Gly Pro lie Asp Phe Ala 

405 410 415 

Tyr Glu He His Thr Lys Val Gly Glu Lys Ala Thr Gly Ala Lys Val 

420 425 430 



Asn Gly Arg Met Val Pro Leu Thr Thr Lys Leu Lys Thr Gly Asp Gin 
435 440 445 



Val Glu He He Ala Asn Pro Asn Ser Phe Gly Pro Ser Arg Asp Trp 
30 450 455 460 

Leu Asn Met Val Lys Thr Ser Lys Ala Arg Asn Lys He Arg Gin Phe 
465 470 475 480 

35 Phe Lys Asn Gin Asp Lys Glu Leu Ser Val Asn Lys Gly Arg Glu Met 

485 490 495 



Leu Met Ala Gin Phe Gin Glu Asn Gly Tyr Val Ala Asn Lys Phe Met 

500 505 510 

Asp Lys Arg His Met Asp Gin Val Leu Gin Lys Thr Ser Tyr Lys Thr 
515 520 525 



Glu Asp Ser Leu Phe Ala Ala He Gly Phe Gly Glu He Gly Ala He 
45 530 535 540 

Thr Val Phe Asn Arg Leu Thr Glu Lys Glu Arg Arg Glu Glu Glu Arg 
545 550 555 560 

50 Ala Lys Ala Lys Ala Glu Ala Glu Glu Leu Val Lys Gly Gly Glu Val 

565 570 575 



Lys Val Glu Asn Lys Glu Thr Leu Lys Val Lys His Glu Gly Gly Val 

580 585 590 

Val He Glu Gly Ala Ser Gly Leu Leu Val Arg He Ala Lys Cys Cys 
595 600 605 
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10 



25 



50 



Asn Pro Val Pro Gly Asp .Asp lie Val Gly Tyr lie Thr Lys Gly Arg 

610 615 - 620 

' i ■ , . . . '■ .( ... 

■ i . ' , . 

Gly Val Ala lie Hi's Arg Val Asp Cys Met Asn Leu Arg Ala Gin Glu 
625 " 630 , 635 / . ' 640 

Asn Tyr Glu Gin Arg Leu Leu Asp Val Glu Trp Glu Asp Gin Tyr Ser 

645 650 , 655 



Ser Ser Asn Lys Glu Tyr Leu Ala His lie Asp. lie Tyr Gly 1 ,Leu Asn 

660 665 670 

Arg Thr Gly Leu Leu Asn Asp Val Leu Gin Val Leu Ser Asn Thr Thr 
15 675 i 680 685 

Lys Asn lie Ser Thr Val Asn Ala Gin Pro Thr Lys Asp Met Lys Phe 
690 695 ( , 700 

20 Ala Asn lie His Val Ser Phe Gly lie Ala Asn Leu Ser Thr Leu Thr 
705 710 715 720 



Thr Val Val Asp Lys He Lys Ser Val Pro Glu Val Tyr Ser Val Lys 

725 730 735 

Arg Thr Asn Gly 

740 



30 <210> 172 
<211> 492 
<212> PRT 

<213> Streptococcus pneumoniae 
35 <400> 172 

Met Ser Asn .Trp Asp Thr Lys Phe Leu Lys Lys Gly Phe Thr Phe Asp 
1 5 .10 15 

Asp Val Leu Leu He Pro Ala Glu Ser His Val Leu Pro Asn Asp Ala 
40 20 25 30 

Asp Leu Thr Thr Lys Leu Ala Asp Asn Leu Thr Leu Asn He Pro He 
35 40 45 

45 He Thr Ala Ala Met Asp Thr Val Thr Glu Ser Gin Met Ala He Ala 

50 55 60 



lie Ala Arg Ala Gly Gly Leu Gly Val He His Lys Asn Met Ser He 

* 65 70 75 80 

Ala Gin Gin Ala Asp Glu Val- Arg Lys Val Lys Arg Ser Glu Asn Gly 

85 90 95 



Val He He Asp Pro Phe Phe Leu Thr Pro Glu His Thr He Ala Glu 
55 100 105 110 

Ala Asp Glu Leu Met Gly Arg Tyr Arg He Ser Gly Val Pro Val Val 
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! 

I 

115 120 125 

Glu Thr Leu Glu Asn Arg Lys Leu Val Gly lie Leu Thr Asn Arg Asp 

; 130 135 , 140 . 



Leu Arg Phe lie Ser Asp Tyr Asn Gin Pro lie Ser, Asn His Met, Thr 
145 150 ' 155 1 ' 166 

Ser Glu Asn Leu Val Thr Ala Pro Val Gly Thr Asp Leu Ala Thr Ala 

165 170 f 175 

i 1 . ' 

« f 

Glu Ser lie Leu Gin Glu His Arg lie .Glu Lys Leu Pro Leu Val Asp 

180 I 185 190 

i • 

Glu Glu Gly Ser Leu Ser Gly Leu lie Thr lie Lys Asp lie Glu Lys 
195 200 205 

Val lie Glu Phe Pro Asn Ala Ala Lys Asp Glu Phe Gly Arg Leu Leu 
210 215 • 220 

Val Ala Gly Ala Val Gly Val Thr Ser Asp Thr Phe Glu Arg Ala Glu 
225 230 , 235 240 

Ala Leu Phe Glu Ala Gly Ala Asp Ala lie Val lie Asp Thr Ala His 

245 i 1 250 255 

Gly His Ser Ala Gly Val Leu Arg Lys lie Ala Glu lie Arg Ala His 

260 265 270. 

■ * 

Phe Pro Asp Arg Thr Leu lie Ala Gly Asn lie Ala Thr Ala Glu Gly 
275 280 285 

Ala Arg Ala Leu Tyr Glu Ala Gly Val Asp Val Val Lys Val Gly lie 
290 295 300 

Giy Pro Gly Ser lie Cys Thr Thr Arg Val lie Ala Gly Val Gly Val 
305 310 315 * 320 

Pro Gin Val Thr Ala lie Tyr Asp Ala Ala Ala Val Ala Arg Glu Tyr 

325 330 335 

Gly Lys Thr lie lie Ala Asp Gly Gly lie Lys Tyr Ser Gly Asp He 

340 345 350 

Val Lys Ala Leu Ala Ala Gly Gly Asn Ala Val Met Leu Gly Ser Met 
355 360 365 

Phe Ala Gly Thr Asp Glu Ala Pro Gly Glu Thr Glu He Phe Gin Gly 
370 375 380 

Arg Lys Phe Lys Thr Tyr Arg Gly Met Gly Ser He Ala Ala Met Lys 
385 390 395 400 

Lys Gly Ser Ser Asp Arg Tyr Phe Gin Gly Ser Val Asn Glu Ala Asn 

405 410 415 

Lys Leu Val Pro Glu Gly He Glu Gly Arg Val Ala Tyr Lys Gly Ala 
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420 , ,' 425 430 . 

Ala Ala Asp lie Val Phe Gin Met lie ,Gly Gly He- Arg Ser Gly Met 
435 , 1 . 440 • 445 

Gly Tyr Cys Gly Ala Ala Asn Leu Lys Glu Leu His Asp Asn Ala 'Gin 
450 1 455 460 



Phe lie Glu Met Ser Gly, Ala Gly Leu Lys Glu Ser 1 His Pro His Asp 
10 465 ' 470 1 475 , , 480 



Val Gin lie Thr Asn Glu Ala Pro, Asn Tyr Ser Met 
- , , » ,485 ' 1 490 



<210> 173 " t ' 

<211> 648 
<212> PRT 

<213> Streptococcus pneumoniae 

<4 00> 173 ( ' 
Met Thr Glu Glu lie Lys Asn Leu Gin Ala Gin Asp Tyr Asp Ala Ser 
1 5 10 15 



25 Gin lie Gin Val Leu Glu Gly Leu Glu Ala Val Arg Met Arg Pro Gly 

20 25 30 



Met Tyr He Gly Ser Thr Ser Lys Glu Gly Leu His His Leu Val Trp 
35 40 45 

Glu He Val Asp Asn Ser He Asp Glu Ala Leu Ala Gly Phe Ala Ser 
50 .55 60 



His He Gin Val Phe He Glu Pro Asp Asp Ser He Thr Val Val Asp 
35 65 70 75 80 

Asp Gly Arg Gly He Pro Val Asp He Gin Glu Lys Thr Gly Arg Pro 

85 90 95 

40 Ala Val Glu Thr Val Phe Thr Val Leu His Ala Gly Gly Lys Phe Gly 

100 105 110 



Gly Gly Gly Tyr Lys Val Ser Gly Gly Leu His Gly Val Gly Ser Ser 

115 120 125 

Val Val Asn Ala Leu Ser Thr Gin Leu Asp Val His Val His Lys Asn 
130 135 140 



Gly Lys He His Tyr Gin Glu Tyr Arg Arg Gly His Val Val Ala Asp 

50 145 150 155 160 

Leu Glu lie Val Gly Asp Thr Asp Lys Thr Gly Thr Thr Val His Phe 

165 170 17*5 

55 Thr Pro Asp Pro Lys lie Phe Thr Glu Thr Thr lie Phe Asp Phe Asp 

180 185 190' 
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Lys Leu Asn Lys Arg lie Gin Glu Leu Ala Phe Leu Ash Arg Gly Leu 
195 200 205 

Gin lie Ser lie. Thr Asp Lys Arg Gin Gly Leu Glu Gin Thr Lys His . 



5 210 215 220 

i j 

. i H 

Tyr His Tyr Glu Gly Gly lie Ala Ser Tyr Val Glu Tyr lie Asn Glu 
225 230 235 240 

10 ■ Asn Lys Asp Val lie Phe Asp Thr Pro lie Tyr Thr Asp Gly Glu Met 

245 250 255 



Asp Asp lie Thr, Val Glu Val Al'a Met Gin Tyr Thr Thr Gly Tyr' His 

260 • 265 270 

Glu Asn Val Met Ser Phe Ala Asn Asn lie His Thr His Glu Gly Gly 

275 280 285 1 



Thr His Glu Gin Gly Phe Arg Thr Ala Leu Thr Arg Val lie Asn Asp 
20 290 295 300 

Tyr Ala Arg Lys Asn Lys Leu Leu Lys Asp Asn Glu Asp Asn Leu Thr 
305 310 315 ' 320 

j 

25 Gly Glu Asp Val Arg Glu Gly Leu Thr Ala Val lie Ser Val Lys His 

325 *330 335 



Pro Asn Pro Gin Phe Glu Gly Gin Thr Lys Thr Lys Leu 'Gly Asn Ser 

340 345 350 

Glu Val Val Lys lie Thr Asn Arg Leu Phe Ser Glu Ala Phe Ser Asp 
355 360 ' 365 



Phe Leu Met Glu Asn Pro Gin lie Ala Lys Arg lie Val Glu Lys Gly 
35 370 375 380 

lie. Leu Ala Ala Lys Ala Arg Val Ala Ala Lys Arg Ala Arg Glu Val 
385 . 390 395 400 

40 Thr Arg Lys Lys Ser Gly Leu Glu lie Ser Asn Leu Pro Gly Lys Leu 

405 410 415 



Ala Asp Cys Ser Ser Asn Asn Pro Ala Glu Thr Glu Leu Phe lie Val 

420 425 430 

Glu Gly Asp Ser Ala Gly Gly Ser Ala Lys Ser Gly Arg Asn Arg Glu 
435 440 445 



Phe Gin Ala lie Leu Pro lie Arg Gly Lys lie Leu Asn Val Glu Lys 
50 450 455 460 

* 

Ala Ser Met Asp Lys lie Leu Ala Asn Glu Glu lie Arg Ser Leu Phe 
465 470 475 480 

55 Thr Ala Met Gly Thr Gly Phe Gly Ala Glu Phe Asp Val Ser Lys Ala 

485 490 * 495 
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Arg Tyr Gin Lys Leu Val Leu Met Thr Asp Ala Asp Val Asp Gly Ala 

500 505 510 

His lie Arg Thr Leu Leu Leu Thr Leu lie Tyf Arg Tyr Met Lys Pro 

5 515 520 525 ' • 

i , • 

r 1 

lie Leu Glu Ala Gly Tyr Val Tyf lie Ala Gin Pro Pro lie Tyr. Gly 
530 535 540 

j 

10 Val Lys Val Gly Ser Glu lie Lys Glu Tyr lie Gin Pro Gly Ala Asp 
545 550 555 . 560 



15 



30 



40 



55 



Gin Glu lie Lys Leu Gin Glu Ala Leu Ala Arg Tyr Ser Glu Gly Arg 

565 570 575 

, i 

Thr Lys Pro Thr lie Gin Arg Tyr Lys Gly Leu Gly Glu Met Asp Asp 

580 ; 585 590 



His Gin Leu Trp Glu Thr Thr Met Asp Pro Glu His Arg Leu Met Ala 
20 595 600 605 

Arg Val Ser Val Asp Asp Ala Ala Glu Ala Asp Lys lie Phe Asp Met 
610 615 620 

25 Leu Met Gly Asp Arg Val Glu Pro Arg Arg Glu Phe lie Glu Glu Asn 
625 630 635 640 



Ala Val Tyr Ser Thr Leu Asp Val 

645 - 



<210>" 174 
<211> 88 
<212> PRT 
35 <213> Streptococcus pneumoniae 



<400> 174 

Met Gly Phe Thr Glu Glu Thr Val Arg Phe Lys Leu Asp Asp Ser Asn 
15 10 15 

Lys Lys Glu lie Ser Glu Thr Leu Thr Asp Val Tyr Ala Ser Leu Asn 

20 25 30 



Asp Lys Gly Tyr Asn Pro lie Asn Gin He Val Gly Tyr Val Leu Ser 

45 • " 35 40 45 

Gly Asp Pro Ala Tyr Val Pro Arg Tyr Asn Asn Ala Arg Asn Gin He 

50 55 60 

50 Arg Lys Tyr Glu Arg Asp Glu He Val Glu Glu Leu Val Arg Tyr Tyr 

65 70 75 80 



Leu Lys Gly Gin Gly Val Asp Leu 

85 



<210> 175 
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<211> 198 

<212> PRT ' 

<213> Streptococcus pneumoniae 

5 <400> 175 ' ' < 

Met Val Asn Tyr Pro His Lys Val Ser Ser Gin Asp Arg. Gin Thr Sfer 
1 5 10 15 

i 

Leu Ser Gin Pro Lys Asn Phe Ala Asn Arg Gly Met, Ser Phe Glu Lys 
10 .20 . 25 , 30 

I* . . ! 

, I ' ■ * I 

Met lie Asn Ala Thr Asn Asp Tyr Tyr Leu Ser Gin Gly Leu Ala Val 

35 40 \ 1 i .45 

15 He His Lys Lys Pro Thr Pro He Gin He Val Gin Val Asp Tyr Pro 

50 55 ,60 



20 



35 



45 



50 



Gin Arg Ser Arg Ala Lys He Val Glu Ala Tyr Phe Arg Gin Ala Ser 
65 70 75 80 

Thr Thr Asp Tyr Ser Gly Val Tyr Asn Gly Tyr Tyr lie Asp Phe Glu 

1 85 90 95 



Val Lys Glu Thr Lys Gin Lys Arg Ala lie Pro Met Lys Asn Phe His 

.25 loo s 105 no 

Pro His Gin lie Gin His Met Glu Gin Val Leu Ala Gin Gin Gly lie 
115 120 ' 125 ' 

30 Cys Phe Val Leu Leu His Phe Ser Ser Gin Gin Glu Thr Tyr Leu Leu 
130 135 140 

Pro Ala Phe Asp Leu lie Arg Phe Tyr His Gin Asp Lys Gly Gin Lys 

145 150 155 160 



Ser Met Pro Leu Glu Tyr lie Arg Glu Tyr Gly Tyr Glu lie Lys Ala 

165' 170' 175 



Gly Ala Phe Pro Gin lie Pro Tyr Leu Asn Val lie Lys Glu His Leu 
40 "180 185 190 



Leu Gly Gly Lys Thr Arg 
195 



<2*10> 176 
<211>' 288 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 176 

Met Ala Leu Phe Ser Lys Lys Asp Lys Tyr lie Arg lie Asn Pro Asn 
15 10 15 



55 Arg Ser Val Arg Glu Lys Pro Gin Ala Lys Pro Glu Val Pro Asp Glu 

20 25 30 
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* 

Leu Phe Ser Gin Cys Pro Gly Cys Lys His Thr lie Tyr Gin Lys Asp 
35 \ 40 45 

Leu Gly Ser Glu ' Arg ( lie Cys Pro His Cys Ser Tyr Thr Phe Arg lie 
5 50 ' 55 1 i 60 

Ser Ala Gin Glu Arg Leu Ala Leu Thr lie Asp Met Gly Thr Phe Lys 
65 70 75 80 

a 

10 Glu Leu Phe Thr Gly lie ' Glu Ser Lys t Asp Pro Leu His Phe Pro ,Gly 

85' 90 , " ,95 



15 



30 



45 



50 



Tyr Gin Lys Lys Leu Ala Ser Met Arg Glu Lys Thr Gly Leu His Glu 

1 100 105 ' , 110 . 

Ala Val Val Thr Gly Thr Ala Leu lie Lys GlyGln Thr Val Ala Leu 
115 120 1 125 



Gly lie Met Asp Ser Asn Phe lie Met Ala Ser Met Gly Thr Val Val 
20 130 135 . 140 

Gly Glu Lys lie Thr Arg Leu Phe Glu Tyr Ala Thr Val' Glu Lys Leu 
145 150 155 160 

I 1 T 

25 Pro Val Val Leu Phe Thr Ala Ser Gly Gly Ala Arg Met Gin Glu Gly 

165 170 175 



lie Met Ser Leu Met Gin Met Ala Lys lie Ser Ala Ala Val Lys Arg 

180 185 190 

His Ser Asn Ala Gly Leu Phe Tyr Leu Thr lie Leu Thr Asp Pro Thr 
195 200 205 



Thr Gly Gly Val Thr Ala Ser Phe Ala Met' Glu Gly Asp lie lie Leu 
35 210 215 220 

Ala Glu Pro Gin Ser Leu Val Gly Phe Ala Gly Arg Arg Val lie Glu 
225 230 235 240 

40 Asn Thr Val Arg Glu Ser Leu Pro Glu Asp Phe Gin Lys Ala Glu Phe 

245 250 255 



Leu Leu Glu His Gly Phe Val Asp Ala lie Val Lys Arg Arg Asp Leu 

260 265 270 

Pro Asp Thr lie Ala Ser Leu Val Arg Leu His Gly Gly Ser Pro Arg 
275 * 280 285 



<210> 177 
<211> 139 
55 <212> PRT 

<213> Streptococcus pneumoniae 
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• ! 

<400> 177 

Met- Arg lie Met Gly Leu Asp Val Gly Ser Lys Thr Val Gly Val Ala 
I 5 10 15 

lie Ser Asp Pro Leu Gly Phe Thr Ala "Gin Gly Leu Glu lie lie Gin 1 

20 25 . ' ■ , < "30 , ' , 

lie Asn Glu Glu Gin Gly Gin Phe Gly Ser Asp Arg Val Lys Glu Leu 
35 40 45 

Val Asp Thr Tyr Lys Val Glu Arg Phe Val Val Gly Leu Pro Lys Asn 
50 55 i 60 



Met Asn Asn Thr' Ser Gly Pro Arg Val Glu Ala Ser Gin Ala Tyr Gly 
15 65 , 70 j 75 80 

Ala Lys Leu Glu Glu Phe Phe Gly Leu Pro Val Asp Tyr Gin Asp Glul r ■ 

85 90 95 

i 

20 Arg Leu Thr- Thr Val Ala Ala Glu Arg Met Leu lie Glu Gin Ala Asp 

100 : 105 110 



lie Ser Arg Asn Lys Arg Lys Lys Val lie Asp Lys Leu Ala Ala Gin 
115. 120 125 

* t 

Leu lie Leu Gin Asn Tyr Leu Asp Arg Lys Phe. 
130 135 



30 <210> 178 
<211> 398 
<212> PRT 

<213> Streptococcus pneumoniae 
35 <400> 178 

Met Ala Lys Leu Thr Val Lys Asp Val Asp Leu Lys Gly Lys Lys Val 
1 5 10 15 

Leu Val Arg Val Asp Phe Asn Val Pro Leu Lys Asp Gly Val lie Thr 
40 20 25 30 

Asn Asp Asn Arg lie Thr Ala Ala Leu Pro Thr lie Lys Tyr lie lie 
35 40 45 

45 Glu Gin Gly Gly Arg Ala lie Leu Phe Ser His Leu Gly Arg Val Lys 

50 55 60 



Glu Glu Ala Asp Lys Ala Gly Lys Ser Leu Ala Pro. Val Ala Ala Asp 
65 70 75 80 

Leu Ala Ala Lys Leu Gly Gin Asp Val Val Phe Pro Gly Val Thr Arg 

85 '90 95 



Gly Ala Glu Leu Glu Ala Ala lie Asn Ala Leu Glu Asp Gly Gin Val 

55 ioo 105 no 

Leu Leu Val Glu Asn Thr Arg Tyr Glu Asp Val Asp Gly Lys Lys Glu 
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115 120 125 , 

Ser Lys Asn Asp Pro Glu Leu Gly Lys Tyr Tip Ala Ser Leu Gly Asp 

i 130 135 . ' 140 ' 

Gly lie Phe Val Asn Asp Ala Phe Gly Thr Ala His 1 Arg Ala His' Ala ' 
1.45 150 155 160 



Ser Asn Val Gly lie Ser Ala Asn Val Glu Lys Ala Val Ala Gly Phe 
10 165 170 , • 175 1 

Leu Leu Glu Asn Glu lie Ala Tyr lie Gin Glu Ala Val Glu Thr Pro' 

180, ! 185 190 . 



20 



35 



50 



t * 

15 Glu Arg Pro Phe Val Ala lie Leu Gly Gly Ser Lys Val Ser Asp Lys 

195 200 205 



i.' > 



lie- Gly Val lie Glu Asn Leu Leu Glu Lys Ala Asp Lys Val Leu lie 
210 215 220 

Gly Gly Gly Met Thr Tyr Thr Phe Tyr Lys Ala Gin Gly lie Glu lie 
225 230 1 235 240 



Gly Asn Ser Leu Val uiu Glu Asp Lys Leu Asp Val Ala Lys Ala Leu 
25 245 ,250 255 

Leu Glu Lys Ala Asn Gly Lys Leu lie Leu Pro Val Asp Ser Lys Glu 

260 265 270 

i 

30 Ala Asn Ala Phe Ala Gly Tyr Thr Glu Val Arg Asp Thr Glu Gly Glu 

275 280 285 



Ala Val Ser Glu Gly Phe Leu Gly Leu Asp lie Gly Pro Lys Ser lie 
290 295 300 

* 

Ala Lys Phe Asp Glu Ala Leu Thr Gly Ala Lys Thr Val Val Trp Asn 
305 310 315 320 



Gly Pro Met Gly Val Phe Glu Asn Pro Asp Phe Gin Ala Gly Thr lie 
40 325 330 335 

Gly Val Met Asp Ala He Val Lys Gin Pro Gly Val Lys Ser He He 

340 345 350 

45 Gly Gly Gly Asp Ser Ala Ala Ala Ala He Asn Leu Gly Arg Ala Asp 

355 360 365 



Lys Phe Ser Trp He Ser Thr Gly Gly Gly Ala Ser Met Glu Leu Leu 
370 375 380 

Glu Gly Lys Val Leu Pro Gin Leu Ala Ala Leu Thr Glu Lys 
385 .390 395 



55 <210> 179 
<211> 165 
<212> PRT 
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i , » 

<213> Streptococcus pneumoniae 
<400> 179 

Met Leu Lys Ser Glu Lys Gin Ser Arg Tyr Gin Met Leu Asn Glu Glu 

5 i ,5 > ,10 , ■ is 

i 1 i 

Leu Ser Phe Leu Leu Glu Gly Glu Thr Asn Val Leu Ala Asn Leu Ser 

20 25 30 

10 Asn Ala Ser Ala Leu lie .Lys Ser Arg Phe Pro Asn Thr Val Phe Ala 

35 1 40 1 . ,r 45 • ■ » 



15 



30 



55 



i 

Gly Phe Tyr Leu Phe Asp Gly Lys .Glu Leu Val Leu Gly Pro Phe Gin 

50 f ' 55 '* , 60 

Gly Gly Val Ser Cys lie Arg lie Ala. Leu Gly Lys Gly Val Cys Gly 
65 70 , 75 80 



Glu Ala Ala His Phe Gin Glu Thr Val lie Val Gly Asp Val Thr Thr 
20 8*5 90 , 95 

Tyr Leu Asn Tyr lie Ser Cys Asp Ser Leu Ala Lys Ser Glu lie Val 

100 105 110 

25 Val Pro Met Met Lys Asn Gly Gin Leu Leu Gly Val Leu Asp Leu Asp 

115 120 125 



Ser Ser Glu lie Glu Asp Tyr Asp Ala Met Asp Arg Asp Tyr Leu Glu 
130 135 140 

Gin Phe Val Ala lie Leu Leu Glu Lys Thr Ala Trp Asp Phe Thr Met 
145 150 155 160 



Phe Glu Glu Lys Ser 
35 165 



<21Q> 180 ' 
<211> 209 
40 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 180 

Met Thr lie Glu Leu Leu Thr Pro Phe Thr Lys Val Glu Leu Glu Pro 
45 1 5 10 15 

Glu lie Lys Glu Lys Lys Arg Lys Gin Val Gly lie Leu Gly Gly Asn 

20 25 30 

50 Phe Asn Pro Val His Asn Ala His Leu lie Val Ala Asp Gin Val Arg 

35- 40 45 



Gin Gin Leu Gly Leu Asp Gin Val Leu Leu Met Pro Glu Tyr Gin Pro 
50 ' 55 60 

Pro His Val Asp Lys Lys Glu Thr lie Pro Glu His His Arg Leu Lys 
65 70 75 80 
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10 



50 



Met Leu Glu Leu Ala lie . Glu Gly lie Asp Gly Leu Val lie Glu Thr 

85 90 95 

lie Glu Leu Glu Arg Lys Gly lie Ser Tyr Thr Tyr Asp Thr Met Lys 

100 105 1 110 . ' < 

lie Leu Thr Glu Lys Asn Pro Asp Thr Asp Tyr Tyr Phe lie lie Gly 
115 120 , 125 

Ala Asp Met Val Asp Tyr Leu Pro Lys Trp Tyr -Arg lie Asp' , Glu Leu 
130 135 140 



Val Asp Met Val Gin Phe Val Gly Val Gin Arg Pro Arg Tyr Lys Val 

15 145 i 150 155' t 160 

Gly Thr Ser Tyr Pro Val lie Trp Val Asp Val Pro Leu Met Asp lie 

165 11Q 175 



185 { 190 



205 



20 


Ser Ser Ser Met Val 


Arg Ala Phe 






180 






Phe Leu Leu Pro Gin 


Pro Val Leu 






1 r\ r- 


200 


25 




i 






Tyr 






30 


<210> 


181 






<211> 


255 






<212> 


PRT 






<213> 


Streptococcus 


pneumoniae 


35 


<400> 


181 






Met Asn lie Ala Lys. 


lie Val Arg 




. 1 


5 





10 15 

Thr Thr Leu Asp Phe Ala Thr Gly He Phe Asp Glu Phe He Gin Leu 
40 20 25 30 

* * 

His Gly Asp Arg Ser Phe Arg Asp Asp' Gly Ala Val Val Gly Gly He 
35 40 45 

45 Gly Trp Leu Gly Asp. Gin Ala Val Thr Val Val Gly He Gin Lys Gly 

50 55 60 



Lys Ser Leu Gin Asp Asn Leu Lys Arg Asn Phe Gly Gin Pro His Pro 
65 70 75 80 

Glu Gly Tyr Arg Lys Ala Leu Arg Leu Met Lys Gin Ala Glu Lys Phe 

85 90 95 



Gly Arg Pro Val Val Thr Phe He Asn Thr Ala Gly Ala Tyr Pro Gly 
55 100 105- 110 

Val Gly Ala Glu Glu Arg Gly Gin Gly Glu Ala He Ala Arg Asn Leu 



123 



INSDOCID: <WO 0149721A2J_> 



WO 01/49721 PCTYUS00/35604 



115 120 125 . 

Met Glu Met Ser Asp Leu Lys Val Pro lie lie Ala lie lie lie Gly 
130 135 1 140 

Glu Gly Gly Ser Gly Gly Ala Leu Ala Leu Ala Val 'Ala Asp Arg Val 
145 150 155 160 

Trp Met Leu Glu Asn Ser lie Tyr Ala lie Leu Ser Pro Glu Gly Phe 
10 165 i 170 , 175 1 

Ala Ser He Leu Trp Lys Asp Gly Thr Arg Ala Met Glu Ala Ala Glu' 

180 i* 185 190 

1 , 

15 Leu Met Lys He Thr Ser His Glu Leu Leu Glu Met Asp Val Val Asp 

195 200 205 



20 



35 



45 



Lys Val He Ser Glu Val Gly Leu Ser Ser Lys Glu Leu He Lys Ser 
210 215 220 

Val Lys Lys Glu Leu Gin Thr Glu Leu Ala Arg Leu Ser Gin Lys Pro 
225 230 235 240 



Leu Glu Glu Leu Leu Glu Glu Arg Tyr Gin Arg Phe Arg Lys Tyr 
25 245 250 255 



<210> 182 
<211> 169 
30 <212> PRT 

<213> Streptococcus pneumoniae 



<400> 182 



Met He He Lys Val Glu Met Ala Asp Val Glu Val Leu Ala Lys He 
15 10 15 

Ala Lys Gin Thr Phe Arg Glu Thr Phe Ala Tyr Asp Asn Thr Glu Glu 

20 25 30 

40 Gin Leu Gin Glu Tyr Phe Glu Glu Ala Tyr Ser Leu Lys Thr Leu- Ser 

35 40 45 



Thr Glu Leu Gly Asn Pro Asp Ser Glu Thr Tyr Phe lie Met His Glu 
50 55 60 

Glu Glu He Ala Gly Phe Leu Lys Val Asn Trp Gly Ser Ala Gin Thr 
65 70 75 80 



-Glu Arg Glu Leu Glu Asp Ala Phe Glu lie Gin Arg Leu Tyr Val Leu 
50 85 90 95- 

Gin Lys Phe Gin Gly Phe Gly Leu Gly Lys Gin Leu Phe Glu Phe Ala 

100 105 110 

55 Leu Glu Leu Ala Thr Lys Asn Ser Phe Ser Trp Ala Trp Leu Gly Val 

115 120 125 
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i . 
I 

Trp Glu His Asn Thr Lys Ala Gin Ala Phe Tyr Asn Arg Tyr Gly Phe 
130 135 140 

i Glii Lys Phe'Ser Gin His His Phe Met Val Gly Gin Lys Val Asp Thr ' i 

5 145 150 155 160 

i t 

i . » 

Asp Trp Leu Leu Arg Lys Lys' Leu' Arg 

165 



10 



15 



25 



40 



55 



<210> 183 
<211> 529 
<212> PRT 

<213> Streptococcus pneumoniae 

i 

<400> 183 ! 

Met Leu Arg Gly Thr Ala Leu Leu Thr Ala Ser Asn Phe lie Ser Arg 
1 5 10 15 



20 Leu Leu Gly Ala Val Tyr lie lie Pro Trp Tyr He Trp Met Gly Ala 

20 25 30 



Tyr Ala Ala Lys Ala Asn Gly Leu Phe Thr Met Gly Tyr Thr He Tyr 
35 4 0 45 

Ala Trp Phe Leu Leu Val Ser Thr Ala Gly He Pro Val Ala Val Ala 
50 55 60 



Lys Gin Val Ala Lys Tyr Asn Thr Met Arg Glu Glu Glu His Ser Phe 
30 65 70 75 80 

+ 

Ala Leu He Arg Ser Phe Leu Gly Phe Met Thr Gly Leu Gly Leu Val 

85 90 95 

35 Phe Ala Leu Val Leu Tyr Val Phe Ala Pro Trp Leu Ala Asp Leu Ser 

100 . 105 110 



Gly Val Gly Lys Asp Leu He Pro He Met Gin Ser Leu Ala Trp- Gly 
115 120 125 

Val Leu lie Phe Pro Ser Met Ser Val He Arg Gly Phe Phe Gin Gly 
130 135 140 



Met Asn Asn Leu Lys Pro Tyr Ala Met Ser Gin He Ala Glu Gin Val 
45 145 150 155 160 

lie Arg Val He Trp Met Leu Leu Ala Thr Phe He lie Met Lys Leu 

165 170 175 

50 Gly Ser Gly Asp Tyr Leu Ala Ala Val Thr Gin Ser Thr Phe Ala Ala 

180 185 190 



Phe Val Gly Met Val Ala Ser Phe Ala Val Leu He Tyr Phe Leu Ala 
195 200 205 

Gin Glu Ser Ser Leu Lys Arg Val Phe Glu Thr Gly Asp Lys He Asn 
210 215 220 
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25 



40 



50 



55 



Ser Lys Arg Leu Leu Val. Asp Thr lie Lys Glu Ala lie Pro Phe lie 
225 230 235 240 



! i 



Leu Thr Gly Ser Ala lie Gin lie Phe Gin lie 'Leu Asp Gin Leu Thr 

i 245 250' • 255 . 

Phe lie Asn Ser Met Ser Trp Phe Thr Asn Tyr Ser Asn Glu Asp Leu 

260 265 270 

i * 1 
Val Val Met Phe .Ser Tyr Phe Ser Ala Asn Pro Asn Lys He' , Thr Met 

. 275 280 285 



lie teu lie' Ser Val Gly Val Ser lie Gly Ser Val Gly Leu Pro Leu 
15 290 1 , 295 300 

Leu Thr Glu Asn Tyr Val Lys Gly Asp Leu Lys Ala Ala Ser Arg Leu 
305 310 315 320 

20 Val Gin Asp Ser Leu Thr Leu Leu Phe Met Phe Leu Leu Pro Ala Thr 

325 330 335 



Val Gly Val Val Met Val Gly Glu Pro Leu Tyr Thr Val Phe Tyr Gly 

340 345 350 

Lys Pro Asp Ser Leu Ala Leu Gly Leu Phe Val Phe Ala Val Leu Gin 
355 360 365 



Ser lie lie Leu Gly Leu Tyr Met Val Leu Ser Pro Met Leu Gin Ala 
30 370 375 380 

Met Phe Arg Asn Arg Lys Ala Val Leu Tyr Phe lie Tyr Gly Ser lie 
385 390 395 400 

35 Ala Lys Leu Val Leu Gin Leu Pro Thr lie Ala Leu Phe His Ser Tyr 

405 410 415 



Gly Pro Leu lie Ser Thr Thr lie Ala Leu lie lie Pro Asn Val Leu 

420 425 430 

Met Tyr Arg Asp lie Cys Lys Val Thr Gly Val Lys Arg Lys Val lie 
435 ' 440 445 



Leu Lys Arg Thr lie Leu lie Ser Leu Leu Thr Leu Val Met Phe Leu 
45 450 455 460 



Leu lie Gly Thr lie Gin Trp Leu Leu Gly Phe Phe Phe Gin Pro Ser 
465 470 475 480 

Gly Arg Leu Trp Ser Phe Phe Tyr Val Ala Leu Val Gly Ala Met Gly 

485 490 495 

Gly Gly Leu Tyr Met Val Met Ser Leu Arg Thr Tyr Leu Leu Asp Lys 

500 505 510 

Val lie Gly Lys Ala Gin Ala Asp Arg Leu Arg Ala Lys Phe Lys Leu 
515 520 525 
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<210> 184 ' , ' , . ' , 

<211> 155 i • 

<212> PRT . 

<213> Streptococcus pneumoniae 

<400> 184 • ■ ' , 

Met Ser Asp Lys lie Gly Leu Phe Thr Gly Ser Phe Asp Pro j Met Thr 
1 5 I 10 ' 15 



15 Asn- Gly His Leu Asp lie lie Glu Arg Ala Ser' Arg Leu Phe Asp Lys 

20 25 30 



Leu Tyr Val Gly lie Phe Phe Asn Pro His Lys Gin Gly Phe Leu Pro 

35 40 45 

lie Glu Asn Arg Lys Arg Gly Leu Glu Lys Ala Leu Gly f His Leu Glu' 

50 1 55 60 



Asn Val Glu Val Val Ala Ser His Asp Glu Leu Val Val Asp Val Ala 
25 65 70 75 80 

Lys Arg Leu Gly Ala Thr Cys Leu Val Arg Gly Leu Arg Asn Ala Ser 

85 90 95 

30 Asp Leu Gin Tyr Glu Ala Ser Phe Asp Tyr Tyr Asn His Gin Leu Ser 

100 105 110 



Ser Asp lie Glu Thr lie Tyr Leu His Ser Arg Pro Glu His Leu Tyr 
115 120 125 

lie Ser Ser Ser Gly Val Arg Glu Leu Leu Lys Phe Gly Gin Asp lie 
130 135 140 



Ala Cys Tyr Val Pro Glu Ser lie Trp Arg Lys 
40 145 150 155 



<210> 185 
<211> 143 
45 <212> PRT 

<213> Streptococcus pneumoniae 

* 

<40*0> 185 

Met Thr lie Leu Phe Val Val lie Ser Ala Ser Phe Leu Tyr Met Val 
50 1 5 ' . 10 15 

Ser Leu Ser Met Lys Pro Tyr Gin Thr Ala Lys Ser Glu. Gly Glu Lys 

20 * 25 30 

55 Leu Ala Gin Gin Tyr Ala Gly Leu Glu Gin Ala Asp Gin Val Asp Leu 

35 40 45 
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! '■ ' 

Tyr Asn Gly Leu Glu Ser Tyr Tyr Ser Val Leu Gly Arg Asn Lys Gin 

50 55 ! 60 

Gin Glu Ala Leu Ala Val Leu lie Gly Lys Asp Asp His , Lys lie Tyr .. 

5' 65 , 70 75 " . ,80 \ 1 

i 

Val Tyr Gin Leu Asn Gin Gly Val Ser Gin Glu Lys Ala Glu Thr Vai 

85 90 95 

10 Ser Lys Glu Lys Gly Ala Gly Glu lie Asp Lys lie lie Phe Gly Arg 

100 105 ' 110 



15 



40 



55 



i 

Tyr Gin Asp Lys Pro lie Trp Glu Val Lys Ser Gly Ser Asp Phe, Tyr 
115 120 125 ' 

! 

Leu Val Asp Phe Glu Thr Gly Ala Leu Val Asn Lys Glu Gly Leu 

130 135 140 i 



20 <210> 186 
<211> 243 
<212> PRT 

<213> Streptococcus pneumoniae 

25 <400> 186 ! 1 

Met lie Asp lie His Ser His lie Val Phe Asp Val Asp Asp Gly Pro 
15 10 15 

Lys Ser Arg Glu Glu Ser Lys Ala Leu Leu Thr Glu Ala Tyr Arg Gin 
30 20 25 30 

Gly Val Arg Thr lie Val Ser Thr Ser His Arg Arg Lys Gly Met Phe 
35 40 45 

35 Glu Thr Pro Glu Glu Lys lie Ala Glu Asn Phe Leu Gin Val Arg Glu 

50 55 60 



lie Ala Lys Glu Val Ala Ser Asp . Leu Val lie Ala Tyr Gly Ala Glu 

65 70 75 80 

lie Tyr Tyr Thr Pro Asp Val Leu Asp Lys Leu Glu Asn Asn Arg lie 

85 90 95 



Pro Thr Leu Asn Asn Ser Arg Tyr Ala Leu lie Glu Phe Ser Met Asn 
45 100 105 110 

Thr Pro Tyr Arg Asp lie His Ser Ala Leu Asn Lys lie Leu Met Leu 
115 120 125 

50 Gly lie Thr Pro Val lie Ala His lie Glu Arg Tyr Asp Val Leu Glu 
130 135 140 



Asn Asn Glu Lys* Arg Val Arg Glu Leu lie Asp Met Gly Cys Tyr Thr 
145 150 155 160 

Gin lie Asn Ser Ser His Val Leu Lys Ser Lys Leu Phe Gly 'Glu Pro 

165 170 175 
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10 



15 



35 



50 



Tyr Lys Phe Met Lys Lyg Arg Ala Gin Tyr Phe Leu Glu Arg Asp Leu 

180 185 190 



Val His lie lie Ala Ser Asp Met Hi's Asn Val Asp Gly Arg Pro Pro 
195 * 200 . ■ . ' . 205. ' , 

i 1 
His Met Ala Glu Ala Tyr Asp Leu Val Ser Gin Lys Tyr Gly Glu Ala 
210 215 220, 

i 1 
Lys Ala Gin Glu Leu Phe lie Asp Asn Pro, &rgLys lie Val' Met Asp 

225 230 235. 240 ' 

1 . • i . • 

Gin Leu lie 



<210> 187 ( 
<211> 308 
20 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 187 

Met Ser Thr lie Asp Lys Glu Lys Phe Gin Phe Val Lys Arg Asp Asp 
25 1 ,5 10 15 



Phe Ala Ser Glu Thr lie Asp Ala Pro Ala Tyr Ser Tyr Trp Lys Ser 

20 * 25 30 

i 

30 Val Phe Lys Gin Phe Met Lys Lys Lys Ser Thr Val Val Met Leu Gly 

35 40 45 



lie Leu Val Ala He He Leu He Ser Phe He Tyr Pro Met Phe Ser 
50 55 60 

Lys Phe Asp Phe Asn Asp Val Ser Lys Val Asn Asp Phe Ser Val Arg 
65 70 75 80 



Tyr He Lys Pro Asn Ala Glu His Trp Phe Gly Thr Asp Ser Asn Gly 
40 85 90 95 

Lys Ser Leu Phe Asp Gly Val Trp Phe Gly Ala Arg Asn Ser lie Leu 

100 105 110 

45 . He Ser Val He Ala Thr Val He Asn Leu Val He Gly Val Phe Val 

115 120 125 



Gly Gly He Trp Gly He Ser Lys Ser Val Asp Arg Val Met Met Glu 
130 135 140 

Val Tyr Asn Val He Ser Asn He Pro Pro Leu Leu He Val He Val 

145 150 155 160 



Leu Thr Tyr Ser He Gly Ala Gly Phe Trp Asn Leu He Phe Ala Met 
55 165 170 175 

Ser Val Thr Thr Trp He Gly He Ala Phe Met He Arg Val Gin He 
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t 

180 185 190 

t < i [ 

Leu Arg Tyr Arg Asp Leu Glu Tyr Asn Leu Ala Ser Arg Thr Leu Gly 
195 200 205 , 



Thr Pro Thr Leu Lys lie Val Ala Lys Asn lie Met i Pro Gin Leu Val 
210 . 215 220' ' 



Ser Val lie Val Thr Thr Met Thr Gin Met Leu Pro Ser Phe lie Ser 

10 225 230 235 240 

i 1 1 

Tyr Glu Ala Phe Leu Ser Phe Phe Gly Leu Gly Leu Pro lie Thr Val 

245 | 250 255, 

i * * 

I 

15 Pro Ser Leu Gly Arg Leu lie Ser Asp Tyr Ser Gin Asn Val Thr Thr 

♦ 260 1 265 270 



Asn Ala Tyr Leu Phe Trp lie Pro Leu Thr Thr Leu Val Leu Val Ser 
275 280 285 

Leu Ser Leu Phe Val Val Gly Gin Asn Leu Ala Asp Ala Ser Asp Pro 
290 295 300 



Arg Thr His Arg 
25 305 



<210> 188 
<211> 77 
30 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 188 

Met Tyr Asn Leu Leu Leu Thr lie Leu Leu Val Leu Ser Val Val lie 
35 1 , 5 .10 15 

Val lie Ala lie Phe Met Gin Pro Thr Lys Asn Gin Ser Ser Asn Val 

20 25 30 

40 Phe Asp Ala Ser Ser Gly Asp Leu Phe Glu Arg Ser Lys Ala Arg Gly 

35 40 45 



Phe Glu Ala Val Met Gin Arg Leu Thr Gly lie Leu Val Phe Phe Trp 
50 55 60 

Leu Ala lie Ala Leu Ala Leu Thr Val Leu Ser Ser Arg 
65 70 75 



50 <210> 189 
<211> 369 
<212> PRT 

<213> Streptococcus pneumoniae 
55 <400> 189 

Met Phe Arg Arg Asn Lys Leu Phe Phe Trp Thr Thr Glu lie Leu Leu 
15 10 15 



i 



130 



10 



25 



40 
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Leu Thr lie lie Phe Tyr Leu Trp Arg Gin Met Gly Ser Leu lie Asn 

20 25 30 

Pro Phe Val Ser Val Leu Asn Thr lie Met lie Pro Phe Leu Leu Gly 
35 40 ' ' 45 , 

> 

Gly Phe Leu Tyr Tyr Leu Thr Asn Pro lie Val Thr Phe Leu Asn Lys 
50 55 60 

i • 

Val Cys Lys Leu Asn Arg Leu Leu Gly lie Leu lie Thr Leu Cys Thr 



65 70 



75- 80 



Leu Val Trp Gly Met Val lie Gly Val Val Tyr Leu Leu Pro lie Leu 
15 ' , 85 i 90 95 

lie Asn Gin Leu Ser Ser Leu lie lie Ser Ser Gin Thr lie Tyr Ser 1 

100 ; 105 . 110 

20 Arg Val Gin Asp Leu lie lie Asp Leu Ser Asn Tyr Pro Ala Leu Gin 

115 120 125 



i • 



Asn Leu Asp Val Glu Ala Thr lie Gin Gin Leu Asn Leu Ser Tyr Val 



i -an 

j. «j \j 



135 i , 140 



Asp lie Leu Gin Asn lie Leu Asn Ser Val Ser Asn Ser Val Gly Ser 
145 150 155 160 



Val Leu Ser Ala Leu lie Ser Thr Val Leu lie Leu lie Met Thr Pro 

30 165 170 175 

Val Phe Leu Val Tyr Phe Leu Leu Asp Gly His Lys Phe Leu Pro .Met 

180 185 190 

35 Leu Glu Arg Thr lie Leu Lys Arg Asp Arg Leu His lie Ala Gly Leu 

195 200 205 



Leu Lys Asn Leu Asn Ala Thr lie Ala Arg Tyr lie Ser Gly Val Ser 
.210 215 220 

lie Asp Ala lie lie lie Gly Cys Leu Ala Tyr lie Gly Tyr Ser lie 

225 230 235 240 



lie Gly Leu Lys Tyr Ala Leu Val Phe Ala lie Phe Ser Gly Val Ala 

45 245 250 255 

Asn Leu He Pro Tyr Val Gly Pro Ser He Gly Leu He Pro Met He 

260 265 270 

50 ' He Ala Asn He Phe Thr Val Pro His Arg Leu Leu He Ala Val He 

275 280 285 



Tyr Met Leu Val Val Gin Gin Val Asp Gly Asn He Leu Tyr Pro Arg 
290 295 300 

He Val Gly Ser Val Met Lys Val His Pro He Thr He Leu Val Leu 
305 310 315 320 
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10 



35 



50 



Leu Leu Leu Ser Ser Asn lie Tyr Gly Val Val Gly M t lie Val Ala 

325 330 , 335 

Val Pro Thr Tyr Ser lie Leu Lys Glu lie Ser Lys Phe Leu Ser Arg 

340 , 345 350 

* t * i i 

Leu Tyr Glu Asn His Lys lie Met Lys Glu Arg Glu Arg Glu Leu Ala 
355 360 365- 

Lys 1 ' 



15 <210> 190 ' 
<211> 451 1 
<212> PRT 

<213> ' Streptococcus pneumoniae 
20 <400> 190 

Met Tyr Gin Ala Leu Tyr Arg Lys Tyr Arg Ser Gin Asn Phe Ser Gin 
1 5 10 15 

Leu Val Gly Gin Glu Val Val Ala Lys Thr Leu Lys Gin Ala Val Glu 
25 20 25 30 

Gin Glu Lys lie Ser His Ala Tyr Leu Phe Ser Gly Pro Arg Gly Thr 
35 40 45 

30 Gly Lys Thr Ser Val Ala Lys lie Phe Ala Lys Ala Met Asn Cys Pro 

50 55 60 



Asn Gin Val Gly Gly Glu Pro Cys Asn Asn Cys Tyr He Cys Gin Ala 
65 70 75 80 

Val Thr Asp Gly Ser Leu Glu Asp Val He Glu Met Asp Ala Ala Ser 

85 90 95 



Asn Asn Gly Val Asp Glu He Arg Glu He Arg Asp Lys Ser Thr Tyr 
40 100 105 110 

Ala Pro Ser Leu Ala Arg Tyr Lys Val Tyr He lie Asp Glu Val His 

115 120 125 

45 Met Leu Ser Thr Gly Ala Phe Asn Ala Leu Leu Lys Thr Leu Glu Glu 

130 135 140 



Pro Thr Gin Asn Val Val Phe He Leu Ala Thr Thr Glu Leu His Lys 

145 150 • 155 160 

He Pro Ala Thr He Leu Ser Arg Val Gin Arg Phe Glu Phe Lys Ser 

165 170 175 



lie Lys Thr Gin Asp He Lys Glu His He His Tyr lie Leu Glu Lys 
55 180 185 190 

Glu Asn lie Ser Ser Glu Pro Glu Ala Val Glu He He Ala Arg Arg 
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195 , 2Q0 205 ; . 

Ala Glu Gly Gly. Met Arg Asp Ala Leu Ser, lie Leu Asp Gin Ala. Leu 
210 1 215 220 

5 ' ■ . . ■ , ( 

Ser Leu Thr Glri Gly Asn Glu Leu Thr Thr Ala' lie Ser Glu Glu lie 
225 ' 230 235 240 

Thr Gly Thr lie Ser Leu ( Ser Ala Leu Asp Asp Tyr Val Ala Ala Leu 
10 245, 250 / ( 255 ' 

Ser Gin Gin Asp Val Pro Lys Ala Leu Ser Cys Leu' Asn Leu" Leu Phe 
, j .260 . '263. ■ 270 

r 

15 Asp Asn Gly Lys Ser Met Thr Arg Phe Val Thr " Asp Leu Leu His Tyr 

275 280 285 



20 



35 



50 



Leu Arg Asp Leu Leu lie Val Gin Thr Gly Gly Glu Asn Thr His His 
290 295 300 

Ser Ser Va^ Phe Val Glu Asn Leu Ala Leu Pro Gin Lys. Asn Leu Phe 
305 310 315 320 



. Glu Met lie Arg Leu Ala Thr Val Ash Leu Ala Asp lie Lys Ser Ser 
25 325 330 335 

Leu Glh Pro Lys lie Tyr Ala Glu Met Met Thr 'Val Arg Leu Ala Glu 

340 345 350 

30 lie Lys Pro Glu Pro Ala Leu Ser Gly Ala Val Glu Asn Glu lie Ala 

355 360 365 



Thr Leu Arg Gin Glu Val Ala Arg Leu Lys Gin Glu Leu Ser Asn Ala 
370 375 380 

* 

Gly Ala Val Pro Lys Gin Val Ala Pro Ala • Pro Ser Arg Pro Ala Thr 
385 390 395 400 



Gly Lys Thr Val Tyr Arg Val Asp Arg Asn Lys Val Gin Ser He Leu 

40 405 410 415 

Gin Glu Ala Val Glu Asn Pro Asp Leu Thr Arg Gin Asn Leu He Arg 

420 425 430 

45 Leu Gin Asn Ala Trp Gly Glu Val He Glu Ser Leu Gly Gly Pro Asp 

435 440 445 



Lys Leu Cys 
450 



<210> 191 

<211> 662 

<212> PRT 

55 <213> Streptococcus pneumoniae 

<400> 191 
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* 

Met Phe Arg Leu Thr Asn Lys Leu Ala Val Ser Asn Leu lie Lys Asn 
.1 5 10 15 

Arg Lys Leu Tyr Tyr Pro Phe Ala Leu Ala Val L u. Leu, Ala Val Thr 
5' 20 25 ' ' , . 30 1 

' * i ' 

Leu Thr Tyr Leu Phe Tyr Ser Leu Thf Phe Asn Pro Lys He Ala Glu' 
35 40 45 

10 He Arg Gly 'Gly Thr 'Thr He Gin Ala Thr Leu Gly Phe Gly Met Phe 

50 55 60 



15 



30 



45 



Val Val Thr Leu Ala Ser Ala He He Val Leu Tyr Ala Asn Ser, Phe 

65 70 j 75 80 

i 

i 1 ' , 

Val Met Lys Lys Arg Ser Lys Glu Leu Gly He Tyr Gly Met Leu Gly 

85 90 95 I 



Leu Glu Lys Arg His Leu He Ser Met Thr Phe Lys Glu Leu Val Val 
20 100 105 110 

1 i 

Phe Gly He Leu Thr Val Gly Ala Gly He Gly He Gly Ala Leu Phe 
115 120 125 

_ j 

25 Asp Lys Leu He Phe Ala Phe Leu Leu Lys Leu Met Lys Leu Lys Val 
130 135 140 



Glu Leu Val Ala Thr Phe Gin Thr Lys Val Val He Thr Val Leu Val 
145 150 155 160 

* 

Val Phe Gly Leu lie Phe Leu Gly Leu Met Phe Leu Asn Ala Leu Arg 

165 170 175 



He Ala Arg Met Asn Ala Leu Gin Leu Ser Arg Glu Lys Ala Ser Gly 
35 180 185 190 

Glu Lys Lys Gly Arg Phe Leu Pro Leu Gin Thr He Leu Gly Ser He 
195 200 205 

40- Ser Leu Gly He Gly Tyr Tyr Leu Ala Leu Thr Val Lys Asp Pro Leu 
210 215 220 



Thr Ala Leu Thr Thr Phe Phe He Ala Val Leu Leu Val He Phe Gly 
225 230 235 240 

* 

Thr Tyr Leu Leu Phe Asn Ala Gly He Thr Val Phe Leu Gin He Leu 

245 250 255 



Lys Lys Asn Lys Lys Tyr Tyr Tyr Gin Pro Asn Asn Leu He Ser Val 
50 260 265 270 

Ser Asn Leu He Phe Arg Met Lys Lys Asn Ala Val Gly Leu Ala Thr 
275 280 . 285 

55* He Ala He Leu Ser Thr Met Val Leu Val Thr Met Ser Ala Ala Thr 
290 295 300 
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i 

i 

Ser lie Phe Asn Ser Ala Glu Ser Phe Lys Lys Val Leu Asn Pro His 
305 310 315 320 



, Asp Phe Gly Val Ser Gly Gin Asn Val Glu Lys Glu Asp Leu Asp Lys 

5 325 330 335 ' , 

i t i 

Leu Leu Ser Gin Phe Ala Ser Asp Asn Gly Tyr Lys lie Lys Glu Lys 

' 340 345 ' 350 

10 Glu Val Phe Arg Tyr Thr Tyr Phe Gly Val Ala Asn Gin Glu Gly Asn 

355 360 ' 365 



Lys Leu Thr Phe Phe Glu Lys Gly Gin Asn Arg Val Gin Pro Thr • Thr 
370 375 1 380 

r i ' 

Val Phe Met Val Phe Asp Gin Lys Asp Tyr Glu Asri Met Thr Gly Gin 
385 390 395 400 1 



Lys Leu Ser Leu Ser Gly Asn Glu Val Gly Leu Phe Ala Lys Asn Asp 
20 405 410 415 

Gly Leu Lys Gly Gin Lys Thr Leu lie Leu Asn Asp His Gin Phe Ser 

420 425 430 

25 Val Lys Glu Glu Phe Asn Lys Asp Phe lie Val Asn His Val Pro Asn 

435 440 445 



Gin Phe Asn lie Leu Thr Ala Asp Tyr Asn Tyr Leu Val Val Pro Asp 
450 455 460 

Leu Gin Ala Phe Leu Asn Gin Phe Pro Asp Ser Asp lie Tyr Asn Gin 
465 470 475 480 



Phe Tyr Gly Gly Met Asn Val Asn Val Ser Glu Glu Glu Gin Leu Lys 
35- 485 490 495 

Val Ala Glu Glu Tyr Glu Asn Tyr Leu Asn Gin Phe Asn Ala Gin Leu 

500 505 510 

40 Asp Thr Glu Gly Ser Tyr Val Tyr Gly Ser Asn Leu Ala Asp Ala Ser 

515 520 525 



Ser Gin Met Ser Ala Leu Phe Gly Gly Val Phe Phe lie Gly lie Phe 
530 535 540 

Leu Ser lie lie Phe Met Val Gly Thr Val Leu Val lie Tyr Tyr Lys . 
545 550 555 560 • 



Gin He Ser Glu Gly Tyr Glu Asp Arg Glu Arg Phe He He Leu Gin 
50 565 570 575 

Lys Val Gly Leu Asp Gin Lys Gin He Lys Gin Thr lie His Lys Gin 

580 585 590 

55 Val Leu Thr Val Phe Phe Leu Pro Leu Leu Phe Ala Phe lie His Leu 
- • 595 600 605 



i 



135 



iNSDOCID: <WQ 01 49721 A2 \ > 



WO 01/49721 PCT/US00/35604 

Ala Phe Ala Tyr His Met Leu Ser Leu He Leu Lys Val He Gly Val . 

610 . 615 - 620 

Leu Asp Thr Thr Met Met Leu He Val Thr Leu Ser. lie Cys Ala lie 

625 ■ , ' 630 i , 635 \ '< ' 640 

Phe Leu He Ala Tyr Val Leu He pHe Met He Thr Ser Arg Ser Tyr 

64$ 650 655 

Arg Lys He Val Gin Met ( , , 

660 ' . , * ■ 



<210> 192 i ' ■ , 

<211> 296 ' 

<212> PRT ' * ■ 

i i * 

<213> Streptococcus pneumoniae 

<400> 192 '."*.• 

Met Lys Gin Asp Gin Leu Lys Ala Trp Gin Pro Ala Glh Phe Asp Arg 
1 5 10 15 

i 

Phe Val Arg He Leu Glu Gin Asp Gin Leu Asn His Ala Tyr Leu Phe 

,20 25 30 

Ser Gly Phe Phe Gly Ser Leu Glu Met Ala Gin Phe Leu Ala Lys Ser 
35 * 40 45 

Leu Phe Cys Thr Asp Lys Val Gly Val Leu Pro Cys Glu Lys Cys Arg 
50 55 60 

Ser Cys Lys Leu He Glu Gin Glu Glu Phe Pro Asp Val Thr Leu He 
65 70 75 80 

Lys Pro Val Asn Gin Val He Lys Thr Glu Arg He Arg Glu Leu Val 

85 90 95 

Gly Gin Phe Ser Gin Ala Gly lie Glu Ser Gin Gin Gin Val Phe He 

100 105 110 

He Glu Gin Ala Asp Lys Met His Pro Asn Ala Ala Asn Ser Leu Leu 
115 120 125 

Lys Val He Glu Glu Pro Gin Ser Glu Val Tyr He Phe Phe Leu Thr 
130 135 140 

Ser Asp Glu Glu Lys Met Leu Pro Thr He Arg Ser Arg Thr Gin He 
145 150 155 160 

Phe His Phe Lys Lys Gin Glu Glu Lys Leu He Leu Leu Leu Glu Glh 

■ 165 170' 175 

Met Gly Leu Val Lys Lys Lys Ala Thr Leu Leu Ala Lys Phe Ser Gin 

180 185 190 

Ser Arg Ala Glu Ala Glu Lys* Leu Ala Asn Gin Ala Ser Phe Trp Thr 
195 ' 200 205 
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Leu Val Asp Glu Ser Glu Arg Leu Leu Thr Trp Leu Val Ala Lys Lys 
210 215 220 

i ■ 1 i 

5 Lys Glu Ser Tyr Leu Gin Val Ala Lys Leu Ala Asn Leu Ala Asp Asp 

225 230 235' ■ . ' 240, - 



10 



15 



Lys Glu Lys Gin Asp Gin Val Leu Arg lie Leu Glu Val Leu Cys Gly 

245 1 250 255 

i 

1 ' 
Gin Asp Leu Leu Gin Val Arg Val Arg Val lie Leu Gin Asp Leu Leu 



260 



265 



270 



Glu Ala Arg Lys Met Trp Gin Ala Asn Val Ser Phe Gin Asn Ala Met 
275 280 285 



20 



25 



Glu Tyr Leu Val Leu Lys Glu lie 
290 295 



<210> 193 
<211> 204 
<212> PRT 
<213> Streptoco 



<400> 193 

Met Asn Ser Phe Lys Asn Phe Leu Lys Glu Trp Gly Leu Phe Leu Leu 
1 5 10 15 



30 * lie Leu Ser Leu Leu Ala Leu Ser Arg lie Phe Phe Trp Ser Asn Val 

20 25 30 



35 



Arg Val Glu Gly His Ser Met Asp Pro Thr Leu Ala Asp Gly Glu lie 
35 40 ' 45 

Leu Phe Val Val Lys His Leu Pro lie Asp Arg Phe Asp lie Val Val 
50 55 60 



Ala His Glu Glu Asp Gly Asn Lys Asp lie Val Lys Arg Val lie Gly 
40 65 70' 75 80 

Met Pro Gly Asp Thr lie Arg Tyr Glu Asn Asp Lys Leu Tyr lie Asn 

85 90 95 

45 Asp Lys Glu Thr Asp Glu Pro Tyr Leu Ala Asp Tyr lie Lys Arg Phe 

100 105 110 



50 



Lys Asp Asp Lys Leu Gin Ser Thr Tyr Ser Gly Lys Gly Phe Glu Gly 
115 120 125 

Asn Lys Gly Thr Phe Phe Arg Ser lie Ala Gin Lys Ala Gin Ala Phe 
130 135 140 



Thr Val Asp Val Asn Tyr Asn Thr Asn Phe Ser Phe Thr Val Pro Glu 
55 145 150 155 160 



Gly Glu Tyr Leu Leu Leu Gly Asp Asp Arg Leu Val Ser Ser Asp Ser 
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1 165 170 •. 175 ' 

Arg His Val Gly Thr Phe Lys Ala Lys Asp lie Thr Gly Glu Ala Lys 

180' ! , .185 ' ' ' ' 190 

Phe Arg Phe Trp Pro lie Thr Arg lie Gly Thr Phe 1 , 

- • 195 , 200 ' 

* 

10 <210> 194 1 1 

<211> 328 ' ' ' 

<212> PRT , 

<213> Streptococcus pneumoniae t, , ( 

t i ' 1 ■ 

! , 1 , 

15 <400> 194 , 

Met Val Val Phe Thr Gly Ser Thr Val Glu Glu Ala He Gin Lys Gly 
1 5 10 . 15 

Leu Lys Glu Leu Asp lie Pro Arg Met Lys Ala His lie Lys Val lie 

20 20 25 30 

■ 

Ser Arg Glu Lys Lys Gly Phe Leu Gly Leu Phe Gly Lys Lys Pro Ala 
35 40 45 

25 Gin Val Asp lie Glu Ala lie Ser Glu Thr Thr Val Val Lys Ala Asn 

50 55 60 



30 



45 



Gin Gin Val Val Lys Gly Val Pro Lys Lys lie Asn Asp Leu Asn Glu 
65 70 75 * 80 

Pro Val Lys Thr Val Ser Glu Glu Thr Val Asp Leu Gly His Val Val 

85 90 95 



Asn Ala lie Lys Lys lie Glu Glu Glu Gly Gin Gly He Ser Asp Glu 

35 ioo 105 no 

Val Lys Ala Glu He Leu Lys His Glu Arg His Ala Ser Thr He Leu 
115 120 125 

40 Glu Glu- Thr Gly His He Glu He Leu Asn Glu Leu Gin He Glu Glu 
130 135 140 



Ala Met Arg Glu Glu Ala Gly Ala Asp Asp Leu Glu Thr Glu Gin Asp 

145 150 155 160 

Gin Thr Glu Asn Gin Asp Leu Lys Glu Met Gly Leu Lys Val Glu Gin 

165 170 175 



Ser Tyr Asp He Ala Gin Val Ala Thr Asp Val Thr Ala Tyr Val Gin 
50 180 185 190 

Ala He Val Asp Asp Met Asp Val Glu Ala Thr Leu Ser Asn Asp Tyr 
195 200 2-05 

55 Asn Arg Arg Ser He Asn Leu Gin He Asp Thr Asn Glu Pro Gly Arg 
210 215 220 
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! 

I 



15 



40 



55 



He He Gly Tyr His Gly Lys Val Leu Lys Ala Leu Gin Leu Leu Ala 
225 230' 235 240 



Gin Asn Tyr Leu Tyr' Asn Arg Tyr Ser Lys Thr Phe Tyr Val Thr lie • 
5 245 ' 250 ' 255- 

Asn Val Asn Asp Tyr Val Glu His Arg Ala Glu Val Leu Gin Thr Tyr 

260 265 270 

10 Ala Gin Lys Leu Ala ■ Asn Arg Val Leu ■ Glu Glu Gly Arg Ser ( His Lys 

275 . 280 ' 1 285 ' 



1 t i ■ 
Thr Asp Prp Met Ser Asn Ser Glu Arg Lys lie lie His Arg lie lie 

290 .295 1 300 

i i' 

i 

Ser Arg Met Asp Gly Val Thr Ser Tyr Ser Glu Gly Asp Glu Pro Asn 
305 310 315 320 



Arg Tyr Val Val Val Asp Thr Glu 
20 325 



<210> 195 
<211> 460 
25 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 195 

Met Ser Asn Phe Ala lie He Leu Ala Ala Gly Lys Gly Thr Arg Met 
30 1 5 10 15 

Lys Ser Asp Leu Pro Lys Val Leu His Lys Val Ala Gly He Ser Met 

20 25 30 

35 Leu Glu His Val Phe Arg Ser Val Gly Ala He Gin Pro Glu Lys Thr 

35 40 45 



Val Thr Val Val Gly His Lys Ala Glu Leu Val Glu Glu Val Leu Ala 
50 55 60 

Gly Gin Thr Glu Phe Val Thr Gin Ser. Glu Gin Leu Gly Thr Gly His 
65 70 75 80 



Ala Val Met Met Thr Glu Pro He Leu Glu Gly Val Ser Gly His Thr 
45 85 90 95 

Leu Val He Ala Gly Asp Thr Pro Leu He Thr Gly Glu Ser Leu Lys 

100 105 110 

50 Asn Leu He Asp Phe His He Asn His Lys Asn Val Ala Thr He Leu 

115 120 125 ■ 



Thr Ala Glu Thr Asp Asn Pro Phe Gly Tyr Gly Arg He Val Arg Asn 
130 135 140 

Asp Asn Ala Glu Val Leu Arg Ser Leu Leu Ser Arg Arg Met Leu Gin 
145 150 155 * i60 
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10 



25 



40 



55 



lie Leu Lys Ser Lys Ser Arg Lys Ser Thr Leu Val Thr Tyr Val Phe 

165 170 175 



■ i 



Asp Asn Glu Arg Leu Phe Glu Ala Leu Lys Asn He Asn Thr Asn Asn 

180 185 / 190 

Ala Gin Gly Glu Tyr Tyr He Thr Asp Val He Gly He Phe Arg Glu 
195 200 205 

Thr Gly Glu Lys Val Gly Ala Tyr Thr Leu Lys Asp Phe Asp Glu Ser 
210 215 j 220 



Leu Gly Val Asn Asp Arg Val Ala Leu Ala Thr Ala Glu Ser Val Met 
15 225 230 235 240 

Arg Arg Arg lie Asn His Lys His Met Val Asn Gly Val Ser Phe Val 

245 250 255 

20 Asn Pro Glu Ala Thr Tyr He Asp He Asp Val Glu He Ala Pro Glu 

260 265 270 



Val Gin He Glu Ala Asn Val He Leu Lys Gly Gin Thr Lys He Gly 
275 280 , 285 

Ala Glu Thr Val Leu Thr Asn Gly Thr Tyr Val Val Asp Ser Thr He 
290 295 300 



Gly Ala Gly Ala Val lie Thr Asn Ser Met He Glu Glu Ser Ser Val 

30 305 310 315 320 

Ala Asp Gly Val Thr Val Gly Pro Tyr Ala His He Arg Pro Asn Ser 

325 330 335 

35 Ser Leu Gly Ala Gin Val His He Gly Asn Phe Val Glu Val Lys Gly 

340 345 350 



Ser Ser He Gly Glu Asn Thr Lys Ala Gly His Leu Thr Tyr He Gly 
355 360 365 

Asn Cys Glu Val Gly Ser Asn Val Asn Phe Gly Ala Gly Thr He Thr 
370 375 380 



Val Asn Tyr Asp Gly Lys Asn Lys Tyr Lys Thr Val He Gly Val Asn 
45 385 390 395 400 

* > 

Val Phe Val Gly Ser Asn Ser Thr lie lie Ala Pro Val Glu Leu Gly 

405 410 415 

50 Asp Asn Ser Leu Val Gly Ala Gly Ser Thr lie Thr Lys Asp Val Pro 

420 425 430 



Ala Asp Ala lie Ala lie Gly Arg Gly Arg Gin lie Asn Lys Asp Glu 
435 440 445 

Tyr Ala Thr Arg Leu Pro His His Pro Lys Asn Gin 

450- 455 460 
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<210> 196 

' <211> 311 ■ ' ' 

5 <212> PRT • , / 

<21*3> Streptococcus pneumoniae . • 

<400> 196 

Met Ser Lys lie Leu Val Phe Gly His Gin Asn Pro Asp Ser Asp Ala 
10 1 5 10 1 15 ' 

lie Gly Ser .Ser Val Ala Phe Ala Tyr Leu Ala Lys Glu Ala Tyr f Gly' 



20 



20 



25 



30 



15 Leu Asp Thr Glu Ala Val Ala Leu Gly Thr Pro Asn Glu Glu Thr Ala 

35 40 45 



i ■ 



Phe Val Leu Asn Tyr Phe Gly Val Glu Ala Pro Arg Val lie Thr Ser 
50 55 60 

A?a Lys Ala Glu Gly Ala Glu* Gin Val lie Leu Thr Asp His Asn Glu 
65 70 ' 75 80 



Phe Glh Gin Ser Val Ser Asp lie Ala Glu Val Glu Val Tyr Gly Val 
25 85 90 95 

Val Asp His His Arg Val Ala Asn Phe Glu Thr Ala Sex Pro Leu Tyr 

100 , 105 110 

30 Met Arg Leu Glu Pro Val Gly Ser Ala Ser- Ser lie Val Tyr Arg Met 

115 120 125 



35 



Phe Lys Glu His Gly Val Ala Val Pro Lys Glu lie Ala Gly Leu Met 
130 135 140 

Leu Ser Gly Leu lie Ser Asp Thr Leu Leu Leu Lys Ser Pro Thr Thr 
145 150 155 160 



His Pro Thr Asp Lys lie lie Ala Pro Glu Leu Ala Glu Leu Ala Gly 

40 165 170 175 

Val Asn Leu Glu Glu Tyr Gly Leu Ala Met Leu Lys Ala Gly Thr Asn 

180 185 190 

45 Leu Ala Ser Lys Ser Ala Glu Glu Leu lie Asp lie Asp Ala Lys Thr 

195 200 205 



50 



Phe Glu Leu Asn Gly Asn Asn Val Arg Val Ala Gin Val Asn Thr Val 
210 215 220 

Asp lie Ala Glu Val Leu Glu Arg Gin Ala Glu lie Glu Ala Ala Met 

225 230 235 240 



Gin Ala Ala Asn Glu Ser Asn Gly Tyr Ser Asp Phe Val Leu Met lie 
55 245 250 255 



Thr Asp lie Val Asn Ser Asn Ser Glu lie Leu Ala Leu Gly Ala Asn 
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i 

260 265 270 

Met Asp Lys Val Glu Ala Ala Phe Asn Phe Lys Leu Glu Asn Asn His 
275 1 ! . ■ 280 ■ , 1 - ' 285 

5 * ' 1 • | 

Ala Phe Leu Ala Gly Ala Val Ser Arg Lys Lys Gin Val Val Pro Gin 
290 295 300 

i 

Leu Thr Glu Ser Phe Asn Thr 

10 305 310 

i 1 

i 1 . i 
. ■ '• . • , ' 

<210> 197 ' , h , 

<211> 225 f ' ' ( 

15 <212> PRT ( ' 

• <213> Streptococcus pneumoniae 



30 



45 



<400> 197 

Met lie Ser Lys Arg Leu Glu Leu Val Ala Ser Phe Val Ser Gin Gly 
20. 1 5 10 , 15 

Ala lie Leu Leu Asp Val Gly Ser Asp His Ala Tyr Leu Pro lie Glu 

20 25 . 30 

25 Leu Val Glu Arg Gly Gin lie Lys Ser Ala He Ala Gly Glu Val Val 

35 40 45 



Glu Gly Pro Tyr Gin Ser Ala Val Lys Asn Val Glu Ala His Gly Leu 
50 5,5 60 

Lys Glu Lys He Gin Val Arg Leu Ala Asn Gly Leu Ala Ala Phe Glu 

65 70 75 80 



Glu Thr Asp Gin Val Ser Val He Thr He Ala Gly Met Gly Gly Arg 
35 85 90 95 

Leu He Ala Arg He Leu Glu Glu Gly Leu Gly Lys Leu Ala Asn Val 

100 105 110 

40 Glu Arg Leu lie Leu Gin Pro Asn Asn Arg Glu Asp Asp Leu Arg He 

115 120 125 



Trp Leu Gin Asp His Gly Phe Gin He Val Ala Glu Ser He Leu Glu 
130 135 140 

Glu Ala Gly Lys Phe Tyr Glu He Leu Val Val Glu Ala Gly Gin Met 
145 150 155 160 



Lys Leu Ser Ala Ser Asp Val Arg Phe Gly Pro Phe Leu Ser Lys Glu 
50 165 170 175 

Val Ser Pro Val Phe Val' Gin Lys Trp Gin Lys Glu Ala Glu Lys Leu 

180 185 190 

55 Glu Phe Ala Leu Gly Gin He Pro Glu Lys Asn Leu Glu Glu Arg Gin 

195 200 205 



142 



25 



40 



45 



WO 01/49721 PCTYUS00/35604 

Val Leu Val AspLys lie Gin Ala lie Lys Glu Val Leu His Yal Ser 
210 215 ' 220 



Lys 
225 



<210> 198 
<211> 161 

10 <212> PRT • , , ' 

<213> Streptococcus pneumoniae , . • ' i 

<400> 198 , ' 1 

Met Asn Leu Asn Asp lie Lys Asp Leu Met 1 Thr Gin Phe Asp Gin Ser 
15 1 ' i 5 10 ,15 

Ser Leu Arg Glu Phe Ser Tyr Lys Asn Gly thr Asp Glu Leu Gin Phe 

20 25 , , 30 

20 Ser Lys Asn Glu Ala Arg Pro Val Pro Glu Val Ala Thr Gin Val Ala 

35 40 4 5, 



Pro Ala Pro Val Leu Ala Thr Pro Ser Pro Val Ala Pro Thr Ser Ala 

50 55 60 

Pro Ala Glu Thr Val Ala Glu Glu Val Pro Ala Pro Ala Glu Ala Ser 

65 70 75 ' 80 



Val Ala Ser Glu Gly Asn Leu Val Glu Ser Pro Leu Val Gly Val Val 
30 85 90 95 

Tyr Leu Ala Ala Gly Pro Asp Lys Pro Ala Phe Val Thr Val Gly Asp 

100 105 110 

35 Ser Val Lys Lys ' Gly Gin Thr Leu Val He He Glu Ala Met Lys Val 

115 120 125 



Met Asn Glu He Pro Ala Pro Lys Asp Gly Val Val Thr Glu lie Leu 
130 135 140 

Val Ser Asn Glu Glu Met Val Glu Phe Gly Lys Gly Leu Val Arg He 
145 150 155* - 160 

Lys 



<210> 199 
<211> 411 
50 <212> PRT 

* 

<213> Streptococcus pneumoniae 
<400> 199 

Met Lys Leu Asn Arg Val Val Val Thr Gly Tyr Gly Val Thr Ser Pro 
55 1 5 10 15 

He Gly Asn Thr Pro Glu Glu Phe Trp Asn Ser Leu Ala Thr Gly Lys 
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20 



25 



30 



lie Gly lie Gly Gly lie Thr Lys Phe Asp His Ser Asp Phe Asp Val 

35 40 , ■ ■ 45' ' 

His Asn Ala Ala Glu lie Gin Asp Phe Pro Phe Asp 1 Lys Tyr Phe Val 
50 55 60 

Lys Lys Asp Thr Asn Arg Phe Asp Asn Tyr Ser Leu Tyr Ala Leu Tyr 
65 70 , 75 .80 



Ala Ala Gin Glu Ala Val Asn His Ala Asn Leu Asp Val Glu Ala Leu 



85 



90 



95' 



Ash Arg Asp Arg Phe Gly Val lie Val Ala Ser Gly lie Gly Gly lie 



100 



105 



110 



l 



i • > 



Lys Glu lie Glu Asp Gin Val Leu Arg Leu His Glu Lys Gly Pro Lys 
115 120 125 



Arg Val Lys Pro Met Thr Leu Pro Lys Ala Leu Pro Asn Met Ala Ser 
130 135 140 

Gly Asn Val Ala Met Arg Phe Gly Ala Asn Gly Val Cys Lys Ser He 
145 150 155 160 

Asn Thr Ala Cys Ser Ser Ser Asn Asp Ala He Gly Asp Ala Phe Arg 

165 170 175 

Ser lie Lys Phe Gly Phe Gin Asp Val Met Leu Val Gly Gly Thr Glu 

180 185 190 

Ala Ser He Thr Pro Phe Ala He Ala Gly Phe Gin Ala Leu Thr Ala 
195 200 205 

Leu Ser Thr Thr Glu Asp Pro Thr Arg Ala Ser He Pro Phe Asp Lys 
210 215 220 

Asp Arg Asn Gly Phe Val Met Gly Glu Gly Ser Gly Met Leu Val Leu 
225 230 235 240 

Glu Ser Leu Glu His Ala Glu Lys Arg Gly Ala Thr lie Leu Ala Glu 

245 250 255 

Val Val Gly Tyr Gly Asn Thr Cys Asp Ala Tyr His Met Thr Ser Pro 

260 265 270 

His Pro Glu Gly Gin Gly Ala He Lys Ala He Lys Leu Ala Leu Glu 
275 280 285 

Glu Ala Glu He Ser Pro Glu Gin Val Ala Tyr Val Asn Ala His Gly 
290 295 300 



Thr Ser Thr Pro Ala Asn Glu Lys Gly Glu Ser Gly Ala He Val Ala 
305 310 315 320 

Val Leu Gly Lys Glu Val Pro Val Ser Ser Thr Lys Ser Phe Thr Gly 
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325 t , 330 335 

His Leu Leu Gly, Ala Ala Gly Ala Val Glu Ala lie Val Thr lie Glu 

340 ' i 345 ' 350 . 

5 ' 

Ala Met Arg His Asn Phe Val Pro Met Thr Ala Gly Thr Ser Glu Val ' 
355 • 360 365 

Ser Asp Tyr lie Glu Ala Asn Val Val Tyr Gly Gln< Gly Leu Glu Lys 

10 370 , 375 , 380 • 

i' i 

i • ; • • • 

Glu lie Pro Tyr Ala lie Ser Asn Thr Phe Gly Phe Gly Gly His Asn 
385 , , 390 , ' 395 400 

15 Ala Val lieu Ala Phe Lys Arg Trp Glu Asn Arg 

405 ' 410 



<210> 200 
20 <211> 359 
<212> PRT 

i ' 

<213> Streptococcus pneumoniae 
<400> 200 

25 Met Asn lie Tyr Asp Gin Leu Gin Val Val Glu Asp Arg Tyr Glu Glu 
1 5 10 15 

Leu Gly Glu Leu Leu Ser Asp Pro Asp Val Val Ser Asp Thr Lys Arg 

20 25 30 

Phe Met Glu Leu Ser Lys Glu Glu Ala Ser Asn Arg Asp Thr Val lie 
. 35 40 45 



30 



Ala Tyr Arg Glu Tyr Lys Gin Val Leu Gin Asn lie Val Asp Ala Glu 
35 50 55 * 60 

Glu Met lie Lys Glu Ser Gly Gly Asp Ala Asp Leu Glu Glu Leu Ala 
65 70 75 80 

40 Lys Gin Glu Leu Lys Asp Ala Lys Ala Glu Lys Glu Glu Tyr Glu Glu 

85 90 95 



45 



Lys Leu Lys lie Leu Leu Leu Pro Lys Asp Pro Asn Asp Asp Lys Asn 

100 105 110 

lie lie Leu Glu lie Arg Gly Ala Ala Gly Gly Asp Glu Ala Ala Leu 
115 120 125 



Phe Ala Gly Asp Leu Leu Thr Met Tyr Gin Lys Tyr Ala Glu Ala Gin 
50 130 135 140 

Gly Trp Arg Phe Glu Val Met Glu Ala Ser Met Asn Gly Val Gly Gly 
145 150 155 160 

55 Phe Lys Glu Val Val Ala Met Val Ser Gly Gin Ser Val Tyr Ser Lys 

165 170 175 



145 
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30 
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! 

Leu Lys Tyr Glu Ser Gly Ala His Arg Val Gin Arg Val Pro. Val Thr 

180 185 190 



Glu Ser Gin Gly Arg Val His Thr Ser Thr Ala Thr Val , Leu Val Met 

5 ! . 195 200 205 

Pro Glu Val Glu Glu Val Glu Tyr Asp lie Asp Pro Lys Asp Leu Arg 

210 215 220 

10 Val Asp lie Tyr His Ala Ser Gly Ala Gly Gly Gin Asn Val Asn Lys 

225 230 235 ■ ' 240 



Val Ala Thr Ala Val Arg lie Val His Leu Pro Thr Asn lie Lys, Val 

245 I 250 255 



i 

Glu Met Gin Glu Glu Arg Thr Gin Gin Lys Asn Arg Glu Lys Ala Met 

260 : 265 270 i 



Lys lie lie Arg Ala Arg Val Ala Asp His Phe Ala Gin lie Ala Gin 
20 275 280 285 

Asp Glu Gin Asp Ala Glu Arg Lys Ser Thr lie Gly Thr Gly Asp Arg 
290 295. 300 

i 

25 Ser Glu Arg lie Arg Thr Tyr Asn Phe Pro Gin Asn Arg Val Thr Asp 
305 310 315 320 



His Arg lie Gly Leu Thr Leu Gin Lys Leu Asp Thr lie Leu Ser Gly 

325 330 ■ 335 

Lys Leu Asp Glu Val Val Asp Ala Leu Val Leu Tyr Asp Gin Thr Gin 

340 345 350 



Lys Leu Glu Glu Leu Asn Lys 
35 355 



<210> 201 
<211> 559 
40 - <212>. PRT 

<213> .Streptococcus pneumoniae 

<400> 201 

Met Ala Tyr Thr Leu Lys Pro Glu Glu Val Gly Val Phe Ala lie Gly 
45 1 5 10 15 

Gly Leu Gly Glu He Gly Lys Asn Thr Tyr Gly He Glu Tyr Gin Asp 

20 25 30 

50 Glu He He He Val Asp Ala Gly He Lys Phe Pro Glu Asp Asp Leu 

35 40 45 



Leu Gly He Asp Tyr Val He Pro Asp Tyr Ser Tyr He Val Asp Asn 
50 55 60 

He Asp Arg Val Lys Ala Val Leu He Thr His Gly His Glu Asp His 
65 70 75 80 



i ■ > 
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10 



25 



40 



55 



He Gly Gly He Pro Phe Leu Leu Lys Gin Ala Asn Val Pro He Tyr 

85 90 95 

Ala Gly Pro Leu Ala Leu Ala Leu lie Arg Gly Lys Leu Glu Glu His, 

100 105 1 110 

■ i 

Gly Leu Leu Arg Asn Ala Lys Leu Tyr Glu He Asn His Asn Thr Glu 
115 , 120 125 

Leu Thr Phe Lys Asn Leu Lys Ala Thr Phe Phe Arg Thr Thr His Ser 
130 135 i . 140 



He Pro Glu Pro" Leu Gly He Val He His Thr Pro Gin Gly Lys He 

15 1,4 5 r 150 ; 155 160 

Val Cys Thr Gly Asp Phe Lys Phe Asp Phe Thr Pro Val Gly Glu Pro 1 

165 170 175 

20 Ala Asp Leu His Arg Met Ala Ala Leu Gly Glu Glu Gly Val Leu Cys 

180 185 190 



i ' 



Leu Leu Ser Asp Ser Thr Asn Ala Glu Val Pro Thr Phe Thr Asn Ser 

135 200 _ ,205 

Glu Lys Val Val Gly Gin Ser He Met Lys He He Gin Gly He Glu 

210 215 * 220 



Gly Arg He He Phe Ala Ser Phe Ala Ser Asn He Phe Arg Leu Gin 
30 225 230 235 240 

Gin Ala Thr Glu Ala Ala Val Lys Thr Gly Arg Lys He Ala Val Phe 

245 250 255 

35 Gly Arg Ser Met Glu Lys Ala He Val Asn Gly He Asp Leu Gly Tyr 

260 265 " 270 



He Lys Ala Pro Lys Gly Thr Phe He Glu Pro Asn Glu He Lys Asp 

275 280 285 

Tyr Pro Ala Gly Glu Val Leu He Leu Cys Thr Gly Ser Gin Gly Glu 

290 295 300 



Pro Met Ala Ala Leu Ser Arg He Ala Asn Gly Thr His Arg Gin Val 

45 305 310 315 320 

Gin Leu Gin Pro Gly Asp Thr Val lie Phe Ser Ser Ser Pro He Pro 

325 330 335 

50 Gly Asn Thr Thr Ser Val Asn Lys Leu He Asn He He Ser Glu Ala 

340 345 350 



Gly Val Glu Val He His Gly Lys Val Asn Asn lie His Thr Ser Gly 
355 360 ' 365 

His Gly Gly Gin Gin Glu Gin Lys Leu Met Leu Cys Leu He Lys Pro 
370 375 380 



i 



147 



NSDOC1D: <WO 01 49721 A2J_> 



WO 01/49721 PCT/US00/35604 



10 



25 



35 



40 



50 



* 

Lys Tyr Phe Met Pro Val His Gly Glu Tyr Arg Met Gin Lys Val His 
385 390' 395 400 



< i 



Ala Gly Leu Ala Val' Asp Thr Gly Val Glu Lys Asp Asn lie Phe lie 

405 410 ' 415 

: i .i 

t ' 

Met Ser Asn Gly Asp Val Leu Ala Leu Thr Ala Asp Ser Ala Arg lie 

420 425 430 

i 

Ala Gly His Phe Asn Ala Gin Asp lie' Tyr Val Asp Gly Asn. Arg lie 
435 440 445 



Gly Glu lie Gly Ala Ala Val Leu Lys Asp Arg Arg Asp Leu Ser Glu 
15 450 , 455 460 

i 

Asp Gly Val Val Leu Ala Val Ala Thr Val Asp f Phe Lys Ser Gin Met 
465 470 475 480 

20 lie Leu Ser Gly Pro Asp lie Leu Ser Arg Gly Phe Val Tyr Met Arg" 

485 490 ' 495. 



■ i ! 
Glu Ser Gly Asp Leu lie Arg Gin Ser Gin Arg He Leu Phe Asn Ala 

500 505 510 

lie Arg He Ala Leu Lys Asn Lys Asp Ala Ser Val Gin Ser Val Asn 
515 520 525 



Gly Ala He Val Asn Ala He Arg Pro Phe Leu -Tyr Glu Asn Thr Glu 
30 530 535 540 



Arg Glu Pro He He He Pro Met He Leu Thr Pro Asp Glu Glu 
545 550 555 



<210> 202 
<211> 450 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 202 

Met Ala Glu Val Glu Glu Leu Arg Val Gin Pro Gin Asp He Leu Ala 
15 10 15 

45 Glu Gin Ser Val Leu Gly Ala He Phe He Asp Glu Ser Lys Leu Val 

20 25 30 



Phe Val Arg Glu Tyr lie Glu Ser Arg Asp Phe Phe Lys Tyr Ala His 
35 40 45 

Arg Leu He Phe Gin Ala Met Val Asp Leu Ser Asp Arg Gly Asp Ala 
50 55 60 



He Asp Ala Thr Thr Val Arg Thr He Leu Asp Asn Gin Gly Asp Leu 
55 65 70 75 80 

Gin Asn lie Gly Gly Leu Ser Tyr Leu Val Glu He Val Asn Ser Val 
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85 .90 95 

ii' * 
i 

Pro Thr Ser Ala Asn Ala Glu Tyr Tyr Ala Lys lie Val Ala Glu Lys 

100' 1 , 105 ' ' 1 110 

5 ' ■ • ' , , ; 

Ala Met Leu Arg Arg Leu lie Ala Lys Leu Thr Glu. Ser Val Ash ,Gln 
115 , 120 ' 125 

Ala Tyr Glu Ala Ser Gin Pro Ala Asp Glu lie lie, Ala Gin Ala Glu 
10 130 ' 135 ' 140 

Lys Gly Leu lie Asp Val Ser Glu Asn Ala Asn Arg Ser Gly Phe Lys 
145 150 'I 155 160 



20 



35 



50 



15 Asn lie Arg Asp Val Leu Asn Leu Asn Phe Gly Asn Leu Glu Ala Arg 

165 170 . ' • 175 



Ser Gin Gin Thr Thr Asp lie Thr Glly lie Ala Thr Gly Tyr Arg Asp 

180 185 ' 190 

* 

Leu Asp His Met Thr Thr Gly Leu His Glu Glu Glu Leu lie lie Leu 
19'5 200 205 

Ala Ala Arg Pro Ala Val Gly Lys Thr Ala Phe Ala Leu Asn lie Ala 
25 210 215 220 

Gin Asn lie Gly Thr Lys Leu Asp Lys Thr Val Ala lie Phe Ser Leu 
225 230 235 240 

i 

30 Glu Met Gly Ala Glu Ser Leu Val Asp Arg Met Leu Ala Ala Glu Gly 

245 ' 250 255 



Leu Val Glu Ser His Ser lie Arg Thr Gly Gin Leu Thr Asp Glu Glu 

260 265 270 

Trp Gin Lys Tyr Thr lie Ala Gin Gly Asn Leu Ala Asn Ala Ser lie 
275 280 285 



Tyr lie Asp Asp Thr Pro Gly lie Arg lie Thr Glu lie Arg Ser Arg 

40 290 295 300 

Ser Arg Lys Leu Ala Gin Glu Thr Gly Asn Leu Gly Leu lie Val lie 

305 310 315 320 

45 Asp Tyr Leu Gin Leu lie Thr Gly Thr Gly Arg Glu Asn Arg Gin Gin 

325 330 335 



Glu Val Ser Glu lie Ser Arg Gin Leu Lys lie Leu Ala Lys Glu Leu 

'340 345 350 

Lys Val Pro Val lie Ala Leu Ser Gin Leu Ser Arg Gly Val Glu Gin 
355 360 365 



Arg Gin Asp Lys Arg Pro Val Leu Ser Asp lie Arg Glu Ser Gly Ser 
55 370 375 380 

lie Glu Gin Asp Ala Asp lie Val Ala Phe Leu Tyr Arg Asp Asp Tyr 
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385 



390 



395 



400 



Tyr Glu Arg Gly Gly Glu Glu Glu Glu Gly lie Pro Asn Asn Lys Val 

405 410 , 415 

5' ' \ 

Glu Val lie lie Glu Lys Asn Arg Ser Gly Ala Arg,Gly*Thr Val, Glu 

420 425 1 430 

* Leu lie Val Gin Lys Glu Tyr Asn Lys Phe Ser Ser lie Ser Lys Arg 

10 435 440 1 445 
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20 



25 



30 



Glu Ala 
. 450 



<210> 203 

<211> 699 t 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 203 

Met Ala Thr Ala Thr Lys Lys Lys Lys Ser Thr Val Lys Lys Asn Leu 
1 5 . 10 15 

i 

Val lie Val Glu Ser Pro Ala Lys Ala Lys Thr lie Glu Lys Tyr Leu 

20 25 30 

Gly Arg Asn Tyr Lys Val Leu Ala Ser Val Gly His lie Arg Asp Leu 
35 40 45 

Lys Lys Ser Ser Met Ser Val Asp lie Glu Asn Asn Tyr Glu Pro Gin 
50 55 60 



Tyr lie Asn lie Arg Gly Lys Gly Pro Leu lie Asn Asp Leu Lys Lys 
35 65 70 75 80 

Glu Ala Lys Lys Ala Asn Lys Val Phe Leu Ala Ser Asp^Pro Asp Arg 

85 90 95 

40 Glu Gly Glu Ala lie Ser Trp His Leu Ala His lie Leu Asn Leu Asp 

100 105 ' 110 



45 



Glu Asn Asp Ala Asn Arg Val Val Phe Asn Glu lie Thr Lys Asp Ala 
115 120 ' 125 

Val Lys Asn Ala Phe Lys Glu Pro Arg Lys lie Asp Met Asp Leu Val 
130 135 140 



Asp Ala Gin Gin Ala Arg Arg lie Leu Asp Arg Leu Val Gly Tyr Ser 
50 145 150 155 160 

lie Ser Pro lie Leu Trp Lys Lys Val Lys Lys Gly Leu Ser Ala Gly 

165 170 175 

55 Arg Val Gin Ser lie Ala Leu Lys Leu lie lie Asp Arg Glu Asn Glu 

180 185 190 
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lie Asn Ala Phe Gin Pro Glu Glu Tyr Trp Thr Val Asp Ala Val Phe 
195 200 ' 205 



Lys Lys Gly Thr, Lys Gin Phe His Ala Ser Phe Tyr Gly Val Asp Gly 
210 215 220 

i ■ 

Lys Lys Met Lys Leu Thr. Ser Asn Asn Glu Val Lys Glu Val Leu Ser 
225 230 235 240 



10 



15 



20 



Arg Leu Thr Ser Lys Asp Phe Ser Val Asp Gin Val Asp Lys Lys Glu 

245 250 255 



Arg Lys Arg Asn Ala Pro Leu Pro Tyr Thr Thr Ser Ser Met Gin 'Met 

260 j 265 270 

j ! ' 

Asp Ala Ala Asn Lys lie Asn Phe Arg Thr Arg Lys Thr Met Met Val, 



275 



280 



285 



Ala Gin Gin Leu Tyr Glu Gly lie Asn lie Gly Ser Gly Val Gin Gly 
290 295 300 



Leu He Thr Tyr Met Arg Thr Asp Ser Thr Arg He Ser Pro Val Ala 

305 310 315 320 

25 Gin Asn Glu Ala Ala Ser Phe He Thr Asp Arg Phe Gly Ser Lys Tyr 

325 330 335 



30 



Ser Lys His Gly Ser Lys Val Lys Asn Ala Ser Gly Ala Gin Asp Ala 

340 345 350 

His Glu Ala He Arg Pro Ser Ser Val Phe Asn Thr Pro Glu Ser He 

355 360 365 



Ala Lys Tyr Leu Asp Lys Asp Gin Leu Lys Leu Tyr Thr Leu He Trp 
35 370 375 380 



40 



Asn Arg Phe Val Ala Ser Gin Met Thr Ala Ala Val Phe Asp Thr Met 
385 390 , 395 400 

Ala Val Lys Leu Ser Gin Lys Gly Val Gin Phe Ala Ala Asn Gly Ser 

405 410 415 



45 



Gin Val Lys Phe Asp Gly Tyr Leu Ala He Tyr Asn Asp Ser Asp Lys 

420 425 430 

Asn Lys Met Leu Pro Asp Met Val Val Gly Asp Val Val Lys Gin Val 
435 440 445 



Asn Ser Lys Pro Glu Gin His Phe Thr Gin Pro Pro Ala Arg Tyr Ser 
50 450 455 460 



Glu Ala Thr Leu He Lys Thr Leu Glu Glu Asn Gly Val Gly Arg Pro 

465 470 475 1 480 

55 Ser Thr Tyr Ala Pro Thr He Glu Thr He Gin Lys Arg Tyr Tyr Val 

485 490 495 
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Arg Leu Ala Ala Lys Arg Phe Glu Pro Thr Glu Leu Gly Glu lie Val 

500 ' , ' 505 510 ' 

sn Lys Leu He Vai Glu Tyr Phe Pro Asp lie Val 'Asn Val Thr Phe 
515 ' 520 i . 525 ' . • ' 



' i 



Thr Ala Glu Met Glu Gly Lys Leu Asp Asp Vai Glu Val Gly Lys Glu 
530 535 540 



Gin Trp Arg Arg Val lie Asp Ala Phe Tyr Lys Pro Phe Ser 1 ,Lys Glu 
545 550 ,' 555 560 

• i • , 

Val Ala Lys Ala Glu Glu Glu Met Glu Lys lie Gin He Lys Asp Glu 

5,65 570 575 

Pro Ala Gly Phe Asp Cys Glu Val Cys Gly 'Ser Pro Met Val He Lys 

580 585 590 

Leu Gly Arg Phe Gly Lys Phe Tyr Ala Cys Ser Asn Phe Pro Asp Cys 

595 600 605 

i 

Arg His Thr Gin Ala He Val Lys Glu He Gly Val Glu Cys Pro Ser 
610 ,615 620 

Cys His Gin Gly Gin He He Glu Arg Lys Thr Lys Arg Asn Arg Leu 
625 630 635 640 

Phe Tyr Gly Cys Asn Arg Tyr Pro Glu Cys Glu Phe Thr Ser Trp Asp 

645 650 655 

Lys Pro Val Gly Arg Asp Cys. Pro Lys Cys Gly Asn Phe Leu Met Glu 

660 665 670 

Lys Lys Val Arg Gly Gly Gly Lys Gin Val Val Cys Ser Lys Gly Asp 
675 680 . 685 

Tyr Glu Glu Glu Lys Met Ala Leu Cys Gin Leu 
690 695 



<210> 204 

*<211> 326 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 204 

Met Phe He Ser He Ser Ala Gly He Val Thr Phe Leu Leu Thr Leu 
1 5 10 15 

Val Gly He Pro Ala Phe He Gin Phe Tyr Arg Lys Ala Gin .He Thr 

20 25 30 

Gly Gin Gin Met His Glu Asp Val Lys Gin His Gin Ala Lys Ala Gly 
35 40 45 

Thr Pro Thr Met Gly Gly Leu Val Phe Leu He Thr Ser Val Leu Val 
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i 

50 55 60 

Ala Phe Phe Phe Ala Leu Phe Ser Ser Gin Phe Ser Asn Asn Val Gly 

65 : 70 75 80 ! - 



Met lie Leu Phe lie Leu Val Leu Tyr Gly Leu ' Val Gly Phe Leu Asp 

85 90 95 

Asp Phe Leu Lys Val Phe Arg Lys lie Asn Glu Gly Leu Asn Pro Lys- 

10 100 , 105 i 110 

Gin Lys Leu Ala Leu Gin Leu Leii Gly Gly Val lie . Phe Tyr Leu Phe 

115 120 125 

15 Tyr Glu Arg Gly Gly Asp Met Leu Ser Val Phe Gly Tyr Gin Val His 

130 135 140 



Leu Gly lie Phe Tyr lie Val Phe Ala Leu Phe Trp Leu Val Gly Phe 
145 150 155 160 

Ser Asn Ala Val Asn Leu Thr Asp Gly Val Asp Gly Leu Ala Ser lie 

165 1 170 175 



Ser Val Val lie Ser i^eu Ser Ala Tyr Gly Val lie Ala Tyr Val Gin 

25 180 ' 185 190 

Gly Gin Met Asp lie Leu Leu Val lie Leu Ala Met lie Gly Gly Leu 

195 200 205 

30 Leu Ser Phe Phe lie Phe Asn His Lys Pro Ala Lys lie Phe Met Gly 

210 215 * - 220 



Asp Val Gly Ser Leu Ala Leu Gly Gly Met Leu Ala Ala lie Ser Met 
225 230 235 240 

Ala Leu His Gin Glu Trp Thr Leu Leu lie lie Gly lie Val Tyr Val 

245 250 255- 



Phe Glu Thr Thr Ser Val Met Met Gin Val Ser Tyr Phe Lys Leu Thr 
40 260 265 270 

Gly Gly Lys Arg He Phe Arg Met Thr Pro Val His His His Phe Glu- 
275 280 285 

45 Leu Gly Gly Leu Ser Gly Lys Gly Asn Pro Trp Ser Glu Trp Lys Val 
290 295 300 



Asp Phe Phe Phe Trp Gly Val Gly Leu Leu Ala Ser Leu Leu Thr Leu 
305 310 315 320 

Ala He Leu Tyr Leu Met 

325 



55 <210> 205 
<211> 693 
<212> PRT 



i 



i 
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<213> Streptococcus pneumoniae 
<400> 205 ' 

Met Ala Arg Glu Phe Ser Leu Glu Lys Thr 1 Arg Asn. lie Gly lie Met 
1 5 ' . ,10 15 

Ala His Val Asp Ala Gly. Lys Thr THr Thr Thr Glu Arg' lie Leu Tyr 

20 1 25 .30 

Tyr Thr Gly Lys lie His Lys lie Gly Glu Thr His Glu . Gly' Ala Ser 
35 ' 40 ' . , 45 . , 

Gin Met Asp Trp Met Glu Gin Glu <31n Glu Arg Gly. lie Thr lie Thr 

<50 t - ■ 55 . ' ' , 60 

Ser Ala Ala Thr Thr Ala' Gin Trp Asn Asn His Arg Val Asn lie lie 
65 70 ,75 80 

Asp Thr Pro Gly His Val Asp Phe Thr lie Glu Val Gin Arg Ser Leu 

85 90 95 

* 

Arg Val Leu Asp Gly Ala Val Thr Val Leu Asp Ser Gin 1 Ser Gly Val 

100 105 110 

* * 

Glu Pro Gin Thr Glu Thr Val Trp Arg Gin Ala Thr ' Glu Tyr Gly Val 
115 ' 120 125 

* r 

Pro Arg lie Val Phe Ala Asn Lys Met Asp Lys lie Gly Ala Asp Phe 
130 135 140 

Leu Tyr Ser Val Ser Thr Leu His Asp Arg Leu Gin Ala Asn Ala His 
145 . 150 155 160 

Pro lie Gin Leu Pro lie Gly Ser Glu Asp Asp Phe Arg Gly lie lie 

165 170 175 

Asp Leu lie Lys Met Lys Ala Glu lie Tyr Thr Asn Asp Leu Gly Thr 

180 " 185 190 

Asp lie Leu Glu Glu Asp lie Pro Ala Glu Tyr Leu Asp Gin Ala Gin 
195 200 205 

Glu Tyr Arg Glu Lys Leu lie Glu Ala Val Ala Glu Thr Asp Glu Glu 
210 215 220 

Leu Met Met Lys Tyr Leu Glu Gly Glu Glu lie Thr Asn Glu Glu Leu 
225 230 235 240 

Lys Ala Gly lie Arg Lys Ala Thr lie Asn Val Glu Phe Phe Pro Val 

245 250 255 

Leu Cys Gly Ser Ala Phe Lys Asn Lys Gly Val Gin Leu Met Leu Asp 

260 265 270 

Ala Val lie Asp Tyr Leu Pro Ser Pro Leu Asp lie Pro Ala lie Lys 
275 280 285 
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Gly lie Asn Pro Asp Thr Asp Ala Glu Glu lie Arg Pro Ala Ser Asp 

290 295 ' 300 

Glu Glu Pro Phe Al'a Ala Leu Ala Phe Lys Ile Met 1 Thr Asp Pro Phe 
5 305 < 310 ' - 315 , ' 320 

Val Gly Arg Leu Thr ( Phe Phe Arg Val Tyr Ser' Gly Val Leu Gin Ser 

325 330 335 

10 Gly Ser Tyr Val Leu Asn Thr Ser Lys Gly Lys.' Arg Glu Arg lie Gly 

340 ' 345 ' ... 350 . 



15 



30 



45 



* i 

Arg lie Leu Gin Met His Ala Asn Ser, Arg Gin Glu lie Asp Thr Val 

355' ' 360 ' , 365 

Tyr Ser Gly Asp lie Ala Ala Ala Val Gly Leu Lys Asp Thr Thr Thr 
370 375 t 380 



Gly Asp Ser Leu Thr Asp Glu Lys Ala Iiys lie lie Leu Glu Ser lie 
20 385 390 395 400 

Asn Val Pro Glu Pro Val He Gin Leu Met Val Glu Pro 1 Lys Ser Lys 

405 410 ' 415 

25 Ala Asp Gin Asp Lys Met Gly He Ala Leu Gin Lys Leu Ala Glu Glu 

420 425 430 



Asp Pro Thr Phe ■ Arg Val Glu Thr Asn Val Glu Thr Gly Glu Thr Val 
435 , 440 445 

lie Ser Gly Met Gly Glu Leu His Leu Asp Val Leu Val Asp Arg Met 
450 "455 460 

Arg Arg Glu Phe Lys Val Glu Ala Asn Val Gly Ala Pro Gin Val Ser 
35 465 470 475 480 

Tyr Arg Glu Thr Phe Arg Ala Ser Thr Gin Ala Arg Gly Phe Phe Lys 

485 490 495 

40 * Arg Gin Ser Gly Gly Lys Gly Gin Phe Gly Asp Val Trp He Glu Phe 

500 505 510 



Thr Pro Asn Glu Glu Gly Lys Gly Phe Glu Phe Glu Asn Ala lie Val 
515 520 525 

Gly Gly Val Val Pro Arg Glu Phe He Pro Ala Val Glu Lys Gly Leu 
530 535 540 



Val Glu Ser Met Ala Asn Gly Val Leu Ala Gly Tyr Pro Met Val Asp 
50 545 550 555 560 

Val Lys Ala Lys Leu Tyr Asp Gly Ser Tyr His Asp Val Asp Ser Ser 

565 570 ' 575 

55 Glu Thr Ala Phe Lys He Ala Ala Ser Leu Ser Leu Lys Glu Ala Ala 

580 585 590 
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Lys Ser Ala Gin Pro Ala lie Leu Glu Pro Met Met Leu Val Thr lie 
595 600 605 

Thr Val Pro Glu Glu Asn Leu Gly Asp Val Met Gly His Val Thr Ala , 

610 615 620 1 

i 

■ i ' 

Arg Arg Gly Arg Val Asp Gly Met Glu Ala His Gly Asn Ser Gin lie' 
625 - 630 635 640 

Val Arg Ala Tyr Val Pro Leu Ala Glu Met Phe Gly Tyr Ala Thr Val 

645 ' 650 655 ' 

Leu Arg Ser Ala Ser Gin Gly Arg Gly Thr Phe Met Met Val Phe Asp 

660 i 665 670 

His Tyr Glu Asp Val Pro Lys Ser Val Gin Glu Glu lie lie Lys Lys 

675 680 685 \ r . 

Asn Lys Gly Glu Asp 
690 



<210> 206 
<211> 408 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 206 

Met Pro Asn Tyr Asn lie Pro Phe Ser Pro Pro Asp lie Thr Glu Ala 
1 5 10 15 

Glu lie Ala Glu Val Ala Asp Thr Leu Arg Ser Gly Trp lie Thr Thr 

20 25 30 

Gly Pro Lys Thr Lys Glu Leu Glu Arg Arg Leu Ser Leu Tyr Thr Gin 
35 40 45 

Thr Pro Lys Thr Val Cys Leu Asn Ser Ala Thr Ala Ala Leu Glu Leu 
50 55 60 

He Leu Arg Val Leu Glu Val Gly Pro Gly Asp Glu Val He Val Pro 
65 70 75 80 

Ala Met Thr Tyr Thr Ala Ser Cys Ser Val He Thr His Val Gly Ala 

85 90 95 

Thr Pro Val Met Val Asp He Gin Ala Asp Thr Phe Glu Met Asp Tyr 

100 105 110 

Asp Leu Leu Glu Gin Ala He Thr Glu Lys Thr Lys Val He He Pro 
115 120 125 

Val Glu Leu Ala Gly He Val Cys Asp Tyr Asp Arg Leu Phe Gin Val 
130 135 140 

Val Glu Lys Lys Arg Asp Phe Phe Thr Ala Ser Ser Lys Trp Gin Lys 
145 150 155 160 
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10 



25 



40 



50 



55 



Ala Phe Asn Arg lie Val lie Val Ser Asp Ser Ala His Ala Leu Gly 

165 170. 175 

Ser Thr Tyr Lys Gly Gin Pro Ser Gly Ser lie Ala Asp Phe Thr Ser, 

180 185, • ■ ' 190 

Phe Ser Phe His Ala Val Lys Asn Phe Thr Thr Ala Glu Gly Gly Ser 

195 , 200 205 

Ala Thr Trp Lys' Ala Asn Pro Val He Asp Asp Glu Glu Met Tyr Lys 



210 . 215 



220 



Glu Phe Gin He Leu Ser Leu His Gly Gin Thr Lys Asp Ala- Leu Ala 
15 225 ■ 230 j 235 240 

Lys Met Gin Leu Gly Ser Trp Glu Tyr Asp He Val Thr Pro Ala Tyr 1 

245 250 255 

20 Lys Cys Asn Met Thr Asp He Met Ala Ser Leu Gly Leu Val Gin Leu 

260 265 270 



Asp Arg Tyr Pro Ser Leu Leu Gin Arg Arg Lys Asp He Val Asp Arg 
275 280 285 

i 

Tyr Asp Ser Gly Phe Ala Gly Ser Arg He His Pro Leu Ala His Lys 
290 295 300 



Thr Glu Thr Val Glu Ser Ser Arg His Leu Tyr He Thr Arg Val Glu 
30 305 310 315 320 

Gly Ala Ser Leu Glu Glu Arg Ser Leu He He Gin Glu Leu Ala Lys 

325 330 335 

35 Ala Gly He Ala Ser Asn Val His Tyr Lys Pro* Leu Pro Leu Leu Thr 

340 345 350 



Ala Tyr Lys Asn Leu Gly Phe Asp Met Thr Asn Tyr Pro Lys Ala Tyr 
355 360 365 

Ala Phe Phe Glu Asn Glu lie Thr Leu Pro Leu His Thr Lys Leu Ser 
370 375 380 



Asp Glu Glu Val Asp Tyr He He Glu Thr Phe Lys Thr Val Ser Glu 
45 385 390 395 400 



Lys Val Leu Thr Leu Ser Lys Lys 

405 



<210> 207 
<211> 325 
<212> PRT 

<213> Streptococcus pneumoniae 



<400> 207 

Met Thr Glu Pro Asp Phe Trp Asn Asp Asn He Ala Ala Gin Lys Thr 



i 
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"1,5 10 15 , 

Ser Gin Glu Leu Asn Val Phe Lys Asn Thr. Tyr Asn Thr Phe His Lys 

20 i , 25. . - . ., . 30 

Met Glu Glu Leu Gin Asp Glu Val Glu lie Leu Leu Asp Phe Leu. Ala 
35 ' 40 '45 ' , 

t 

r 

Glu Asp Glu Ser Val His Asp Glu Leu Val Ala Gin Leu Ala Glu Leu 
50 ,55 ,60 

. . ' ... ... . 

Asp Lys lie Met Thr Ser Tyr Glu Met Thr 1 Leu Leu Leu Ser Glu Pro ■ 
65 70 , - 75 80 

, M . , ■ ' 

> I 1 ' 

i 

Tyr Asp His Asn Asn Ala lie Leu Glu lie 'His Pro Gly Ser Gly Gly 

85 90 95 

Thr Glu Ala Gin Asp Trp Gly Asp Met Leu Leii Arg Met Tyr Thr Arg 

100 , 105 < 110 

Tyr Gly Asn Ala Lys Gly Phe Lys Val Glu Val Leu Asp Tyr Gin Ala 
115 120 125i 

4 

Gly Asp Glu Ala Gly lie Lys .Ser Val Thr Leu Ser Phe Glu Gly Pro 
130 135 140 

Asn Ala Tyr Gly Leu Leu Lys Ser Glu Met Gly Val His Arg Leu Val 
145 150 155 160 

Arg lie Ser Pro Phe Asp Ser Ala Lys Arg Arg His Thr Ser Phe Thr 

165 170 175 

Ser Val Glu Val Met Pro Glu Leu Asp Asp Thr lie Glu Val Glu lie 

180 185 190 

Arg Glu Asp Asp lie Lys Met Asp Thr Phe Arg Ser Gly Gly Ala Gly 
195 200 205 

Gly Gin Asn Val Asn Lys Val Ser Thr Gly Val Arg Leu Thr His lie 
210 215 220 

Pro Thr Gly He Val Val Gin Ser Thr Val Asp Arg Thr Gin Tyr Gly 
225 230 235 240 

Asn Arg Asp Arg Ala Met Lys - Met Leu Gin Ala Lys Leu Tyr Gin Met 

245 250 255 

Glu Gin Glu Lys Lys Ala Ala Glu Val Asp Ser Leu Lys Gly Glu Lys 

260 265 270 

Lys Glu He Thr Trp Gly Ser Gin He Arg Ser Tyr Val Phe Thr Pro 
275 280 285 

Tyr Thr Met Val Lys Asp His Arg Thr Ser Phe Glu Val Ala Gin Val 
290 295 300 

Asp Lys Val Met Asp GjLy Asp Leu Asp Gly Phe He Asp Ala Tyr Leu 
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305 310 315 • 320 

it* * 

Lys Trp Arg lie Ser 

325 i ' ' 

5 ' ' ' ' . 

» 1 

<210> 208 
<211> 249 

<212> PRT ■ 
10 <213> Streptococcus pneumoniae , 

- ■ ■ . ' 

<400> 208 

Met I^he Tyr Thr Tyr Leu Arg Gly Leu 1 Val Val Leu Leu Leu Trp Ser 
1 ' ' 5 ' io - IS- 



IS 



30 



45 



He Asn Gly Asn Ala His Tyr His Asn Thr Asp Lys He Pro Asn Gin 

20 25 30 



Asp Glu Asn Tyr He Leu Val Ala Pro His Arg Thr Trp Trp Asp Pro 

20 35 40 45 

Val Tyr Met Ala Phe" Ala Thr Lys Pro Lys Gin Phe He Phe Met Ala 

50 55 60 

25 Lys Lys Glu Leu Phe Thr Asn Arg He Phe Gly Trp Trp He Arg Met 

65 70 75 80 



Cys Gly Ala Phe Pro He Asp Arg Glu Asn Pro Ser Ala Ser Ala He 

85 90 '95 

Lys Tyr Pro He Asn Val Leu Lys Lys Ser Asp Arg Ser Leu He Met 

100 105 110 



Phe Pro Ser Gly Ser Arg His Ser Asn Asp Val Lys Gly Gly Ala Ala 
35 115 120 125 

Leu lie Ala Lys Met Ala Lys Val Arg He Met Pro Val Thr Tyr Thr 
130 135 140 

40 Gly Pro Met Thr Leu Lys Gly Leu He Ser Arg Glu Arg Val Asp Met 
145 150 155 160 



Asn Phe Gly Asn Pro He Asp lie Ser Asp He Lys Lys Met Asn Asp 

,165 170 175- 

Glu Gly lie Glu Thr Val Ala Asn Arg lie Gin Thr Glu Phe Gin Arg 

180 185 190 



Leu Asp Glu Glu Thr Lys Gin Trp His Asn Asp Lys Lys Pro Asn Pro 
50 195 200 205 

Leu Trp Trp Phe He Arg lie Pro Ala Leu He Leu Ala He lie Leu 
210 215 220 

55 Ala lie Leu Thr He He Phe Ser Phe lie Ala Ser Phe lie Trp Asn 
225 230 235 240 
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Pro Asp Lys Lys Arg Glu Glu Leu Ala 

245 



<210> 209 , . 

<211> 1033 * , ' 

<212> PRT 

<213> Streptococcus pneumoniae 

<400> 209 ■ 

Met lie Ala Gin Leu Asp Thr Lys Thr Val Tyr Ser Phe Met Glu Ser 
1 5 10 15 

1 * 1 ' 

Val lie Ser lie ' Glu Lys Tyr Val Arg Ala Ala Lys Glu Tyr Gly Tyr 

20 25 30 



Thr His Leu Ala Met Met Asp lie Asp Asn Leu Tyr Gly Ala Phe Asp! 
35 40 45 

Phe Leu Glu lie Thr Lys Lys Tyr Gly lie. His Pro Leu Leu Gly Leu 
50 55 60 

Glu Met Thr Val Phe Val Asp Asp Gin Gly Val Asn Leu Arg Phe Leu 

65 70 j 75 80 

; \ 

Ala Leu Ser Ser Val Gly Tyr Gin Gin Leu Met Lys Leu Ser Thr Ala 

85 '90 95 

Lys Met Gin Gly Glu Lys Thr Trp Ser Val Leu Ser Gin Tyr Leu Glu 

100 105 110 

Asp lie Ala Val lie Val Pro Tyr Phe Asp Arg Val Glu Ser Leu Glu 

115 120 125 

» 

Leu Gly Cys Asp Tyr Tyr lie Gly Val Tyr Pro Glu Thr Leu Ala Ser 
130 135 140 

Glu Phe His His Pro lie Leu Pro Leu Tyr Arg Val Asn Ala Phe Glu 
145 150 ' 155 160 

Ser Arg Asp Arg Glu Val Leu Gin Val Leu Thr Ala lie Lys Glu Asn 

165 170 175 

Leu Pro Leu Arg Glu Val Pro Leu Arg Ser Arg Glh Asp Val Phe lie 

180 185 190 

Ser Ala Ser Ser Leu Glu Lys Leu Phe Gin Glii Arg Phe Pro Gin Ala 
195 200 205 

Leu Asp Asn Leu Glu Lys Leu He Ser Gly He Ser Tyr Asp Leu Asp 
210 215 220 

Thr Ser Leu Lys Leu Pro Arg Phe Asn Pro Ala Arg Pro Ala Val Glu 
225 230 235 240 



Glu Leu Arg Glu Arg Ala Glu Leu Gly Leu Val Gin Lys Gly Leu Thr 

2-45 • 250 255 
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10 



25 



40 



55 



Ser Lys Glu Tyr Gin Asp' ,Arg Leu Asp Gin Glu Leu Ser Val lie His 

260 265 270 



< i 



Asp Met Gly Phe' Asp Asp Tyr Phe Leu Val Val Trp Asp Leu Leu Arg 
275 i 280 285. ' . * ( 

Phe Gly Arg Ser Asn Gly Tyr Tyr Met Gly Met Giy Arg Gly Ser Ala 
290 295 300 



Val Gly Ser Leu Val Ser Tyr Ala Leu Asp lie, Thr Gly Ile'^Vsp Pro 
305 310 315 320 



Val dlu Lys Asn Leii lie Phe Glu Arg Phe, Leu Asn Arg Glu Arg Tyr 
15 < 3J25 330 335 

Thr Met Pro Asp lie Asp lie Asp lie Pro Asp lie Tyr Arg Pro Asp 

340 345 350 

i 

20 Phe lie Arg Tyr Val Gly Asn Lys Tyr Gly Ser Lys His Ala Ala Gin 

355 360 365 

i * 

He Val Thr Phe Ser Thr Phe Gly Ala Lys Gin Ala Leu Arg Asp Val 



370 . 375 380 



Leu Lys Arg Phe Gly Val Pro Glu Tyr Glu Leu Ser Ala He Thr Lys 
385 * 390 395 400 



Lys He Ser Phe Arg Asp Asn £eu Lys Ser Ala Tyr Glu Gly Asn Leu 
30 405 410 415 

Gin Phe Arg Gin Gin He Asn Ser Lys Leu Glu Tyr Gin Lys Ala Phe 

420 425 430 

35 Glu* He Ala Cys Lys He Glu Gly Tyr Pro Arg Gin Thr Ser Val His 

435 440 445 



Ala Ala Gly Val Val He Ser Asp Gin Asp Leu Thr Asn Tyr He Pro 
450 455 460 

Leu Lys Tyr Gly Asp Glu He Pro Leu Thr Gin Tyr Asp Ala His Gly 
465 470 ' 475 480 



Val Glu Ala Ser Gly Leu Leu Lys Met Asp Phe Leu Gly Leu Arg Asn 

45 485 490 495 

Leu Thr Phe Val Gin Lys Met Gin Glu Leu Leu Ala Glu He Glu Gly 

500 505 510 

50 He His Leu Lys He Glu Glu He Asp Leu Glu Asp Lys Glu Thr Leu 

515 520 525 



Asp Leu Phe Ala Ser Gly Asn Thr Lys Gly He Phe Gin Phe Glu Glh 

530 535 540 

Pro Gly Ala He Arg Leu Leu Lys Arg. Val Gin Pro Val Cys Phe Glu 

545 550 555 560 
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Asp Val Val Ala Thr Thr Ser Leu Asn Arg Pro Gly Ala Ser Asp Tyr 

565 570 575 

lie Asn Asn Phe Val Ala Arg Lys His Gly Gin Glu Glu Val Thr Val 

580 585 590 

Leu Asp Pro Val Leu Glu Asp lie Leu Ala Pro Thr Tyr Gly lie Met 
595 600 605 



Leu Tyr Gin Glu Gin Val Met Gin Val Ala Gin Arg Phe Ala Gly Phe 
610 615 620 

Ser Leu Gly Lys Ala Asp lie Leu Arg Arg Ala Met Gly Lys Lys Asp 
625 630 635 640 



Ala Ser Ala Met His Glu Met Arg Ala Ser Phe lie Gin Gly Ser Leui 

645 650 655 



i ■ 



Glu Ala Gly His Thr Val Glu Lys Ala Glu Gin Val Phe Asp Val Met 

660 665 670 

Glu Lys Phe Ala Gly Tyr Gly Phe Asn Arg Ser His Ala Tyr Ala Tyr 
675 ' 680 685 

Ser Ala Leu Ala Phe Gin Leu Ala Tyr Phe Lys Thr His Tyr Pro Ala 
690 ' 695 700 

lie Phe Tyr Gin lie Met Leu Asn Ser Ala Asn Ser Asp Tyr Leu lie 
705 710 715 720 

Asp Ala Leu Glu Ala Gly Phe Glu Val Ala Pro Leu Ser lie Asn Thr 

725 730 735 

lie Pro Tyr His- Asp Lys lie Ala Asn Lys Ala lie Tyr Leu Gly Leu 

740 745 750 

Lys Ser lie Lys Gly Val Ser Asn Asp. Leu Ala Leu Trp lie lie Glu 
755 760 * 765 

His Arg Pro Tyr Ser Asn lie Glu Asp Phe lie Ala Lys Leu Pro Glu 
770 775 780 

Asn Tyr Leu Lys Leu Pro Leu Leu Glu Pro Leu Val Lys Val Gly Leu 
785 790 795 800 

Phe Asp Ser Phe Glu Lys Asn Arg Gin Lys Val Phe Asn Asn Leu Ala 

805 810 815 

Asn Lgu Phe Glu Phe Val Lys Glu Leu Gly Ser Leu Phe Gly Asp Ala 

820 825 830 

lie Tyr Ser Trp Gin Glu Ser Glu Asp Trp Thr Glu Gin Glu Lys Phe 
835 840 845 



Tyr Met Glu Gin Glu Leu Leu Gly lie Gly Val Ser Lys His Pro Leu 
850 855 860 
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io 



25 



35 



40 



50 



Gin Ala lie Ala Ser Lys Ala lie Tyr Pro lie Thr Pro lie Gly Asn 
865 870 875 880 

Leu Ser Glu Asn Ser Tyr Ala lie lie Leu Val t Glu Val Gin Lys lie 1 

885 890 ' ' / . 895 

Lys Val lie Arg Thr Lys Lys Gly Glu Asn Met Ala Phe Leu Gin Ala 

900 905 910 

a 

, \ - I 

l 

Asp Asp Ser Lys Lys Lys Leu Asp Val Thr Leu Phe Ser Asp* Leu Tyr 
915 920 925 ' 



Arg Gin Val Gly Gin Glu lie Lys Glu Gly Ala Phe Tyr Tyr Val Lys 
15 930 * 935 i 940 

Gly Lys lie Gin Ser Arg Asp Gly Arg Leu Gin Met lie Ala Gin Glu 
945 950 955 960 

20 He Arg Glu Ala Val Ala Glu Arg Phe Trp lie Gin Val Lys Asn His 

965 ' 970 975 



Glu Ser Asp Gin Glu He Ser Arg: He Leu Glu Gin Phe Lys Gly Pro 

980 !; 985 990 

He Pro Val He He Arg Tyr Glu Glu Glu Gin Lys Thr He Val Ser 
995 1000 1005 



Pro His His Phe Val Ala Lys Ser Asn Glu Leu Glu Glu Lys Leu Asn 
30 1010 1015 1020 



Glu He Val Met Lys Thr He Tyr Arg 
1025 1030 



<210> 210 
<211> 306 
<212> PRT 

<213> Streptococcus pneumoniae 



<400> 210 

Met Thr Asn Glu Phe Leu His Phe Glu Lys He Ser Arg Gin Thr Trp 
1 5 10 15 



45 Gin Ser Leu His Arg Lys Thr Thr Pro Pro Leu Thr Glu Glu Glu Leu 

20 25 30 



Glu Ser He Lys Ser Phe Asn Asp Gin He Ser Leu Gin Asp Val Thr 
35 40 45 

Asp He Tyr Leu Pro Leu Ala His Leu He Gin He Tyr Lys Arg. Thr 
50 55 60 



Lys Glu Asp Leu Ala Phe Ser Lys Gly lie Phe Leu Gin Arg Glu Ser 
55 65 70 75 80 

Lys Ser Gin Pro Phe He He Gly Val Ser Gly Ser Val Ala Val Gly 



i 
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i 

85 ,90 95 

I 

Lys Ser Thr Thr Ser Arg Leu Leu Gin lie Leu Leu Ser Arg Thr Phe 

100 ' 1 ( 105 1 1 110 

Thr Asp Ala Thr, Val Glu Leu Val Thr Thr Asp Gly Phe , Leu Tyr' Pro 
115 , 120 125 

Asn Gin Thr Leu lie Glu Gin Gly lie Leu Asn Arg ,Lys Gly- Phe Pro 
130 135 . ( *' 140 

Glu Ser Tyr Asp Met Glu Ala Leu Leu Asn Phe Leu Asp Arg. lie Lys 
145 150 i 155 160 

t ■ 
Asn Gly Gin Asp Val Asp lie Pro Val Tyr Ser His Glu Val Tyr Asp 

165 170 175 

lie Val Pro Lys Lys Lys Gin Ser Val Lys Ala Ala Asp Phe Val lie 

180 185 ' 190 

Val Glu Gly lie Asn Val Phe Gin Asn Pro Gin Asn Asp Arg Leu Tyr 
195 200 205 ! 

lie Thr Asp Phe Phe Asp Phe Ser lie Tyr Val Asp Ala Gly Val Asp 
210 215 220 

Asp lie Glu Ser Trp Tyr Leu Asp Arg Phe Leu Lys Met Leu Ser Leu 
225 230 235 240 

Ala Gin* Asn Asp Pro Asp Ser Tyr Tyr Tyr Arg Phe Thr Gin Met Pro 

245 250 255 

He Gly Glu Val Giu Ser Phe Ala His Gin Val Trp Thr Ser He Asn 

260 265 270 



Leu Thr Asn Leu Gin Asn Tyr He Glu Pro Thr Arg Asn Arg Ala Glu 
• 275 280 285 

Val He Leu His Lys Ser Lys Asn His Glu He Asp Glu He Tyr Leu 
290 295 300 

Lys Lys 
305 



<210> 211 
<211> 246 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 211 

Met Glu He Ser Leu Leu Thr Asp Val Gly Gin Lys Arg Thr Asn Asn 
1 5 10 15 

Gin Asp Tyr Val Asn His Tyr Val Asn Arg Ala Gly Arg Thr Met He 

20 25 30 
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lie Leu Ala Asp Gly Met Gly Gly His Arg Ala Gly Asn lie Ala Ser 
35 • 40 45 

* 

Glu -Met Ala Val Thr ( Asp Leu Gly Val Ala Trp Val Asp Thr Gin lie 
5 50 " , 55 . . ,60 

Asp Thr Val Asn Glu Val Arg Glu Trp Phe Ala His Tyr Leu Glu lie 
65 70 75 80 

10 Glu Asn Gin Lys lie His -Gin Leu Gly Gin Asp Glu' Ala Tyr Arg Gly 

85' ' 90 ■ - . . ■ ■ 95 



15 



30 



55 



Met Gly Thr Thr Leu Glu Val Leu Al^i He lie Asp Asn Gin Ala He 

i 100 105 - , - 110 

Tyr Ala His He Gly Asp Ser Arg He Gly Leu He Arg. Gly Glu Glu 

115 120 . 125 



Tyr His Gin Leu Thr Ser Asp His Ser Leu Val Asn Glu Leu Leu Lys 
20 130 135 140 

Ala Gly Gin Leu Thr Pro .Glu Glu Ala Glu Ala His Prb Gin Lys Asn 
145 150 155 160 

25 He He Thr Gin Ser He Gly Gin Lys Asp Glu He Gin Pro Asp Phe 

165 170 175 



Gly Thr Val He Leu Glu Ser Gly Asp Tyr. Leu Leu Leu Asn Ser Asp 

180 185 190 

Gly Leu Thr Asn Met He Ser. Gly Ser Glu He Arg Asp He Val Thr 
195 200 205 



Ser Asp He Pro Leu Ala Asp Lys Thr Glu Thr Leu Val Arg Phe Ala 
35 210 215 220 

Asn Asn Ala Gly Gly Leu Asp Asn He Thr Val Ala Leu Val Ser Met 
225 230 235 240 

40 Asn Glu Glu Asp Glu Glu 

245 



<210> 212 
45 <211> 276 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 212 

50 Met Thr He Gin Met Lys Asn Thr Gly Lys Arg He Asp Leu He Ala 
1 5 10 15 . 

Asn Arg Lys Pro Gin Ser Gin Arg Val Leu Tyr Glu Leu Arg Asp Arg 

20 25 30 

Leu Lys Arg Asn Gin Phe He Leu Asn Asp Thr Asn Pro Asp He Val 
35 40 45 
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lie Ser lie Gly Gly Asp Gly Met Leu Leu Ser Ala Phe His Lys Tyr 

50 55 60 • . 

Glu Asn Gin Leu Asp Lys Val Arg Phe lie Gly Leu His Thr Gly H,is 

65 70 75, ■ 80 

■ i i 

t 

Leu Gly Phe Tyr Thr Asp Tyr Arg Asp Phe Glu Leu Asp Lys Leu Val 

85 90 95 

Thr Asn Leu Gin Leu Asp Thr Gly Ala Arg Val Ser Tyr Pro Val L'eu 

100 105 . 110 

Asn Val Lys Val Phe Leu Glu Asn Gly Glu Val Lys lie Phe Arg Ala 
115 120 . 125 

! 

Leu Asn Glu Ala Ser lie Arg Arg Ser Asp Arg Thr Met Val Ala Asp] 
130 135 140 

lie Val lie Asn Gly Val Pro Phe Glu Arg Phe Arg Gly Asp Gly Leu 
145 150 . 155 160 

Thr Val Ser Thr Pro Thr Gly Ser Thr Ala Tyr Asn Lys Ser Leu Gly 

165 170 175 

i i 
i 

Gly Ala Val Leu His Pro Thr lie Glu Ala Leu Gin Leu Thr Glu lie 

180 185 190 

Ala Ser Leu Asn Asn Arg Val Tyr Arg Thr Leu Gly Ser Ser lie lie 
195 200 205 

Val Pro Lys Lys Asp Lys lie Glu Leu lie Pro Thr Arg Asn Asp Tyr 
210 215 220 

His Thr lie Ser Val Asp Asn Ser Val Tyr Ser Phe Arg Asn lie Glu 
225 230 235 240 

Arg He Glu Tyr Gin He Asp His His Lys He His Phe Val Ala Thr 

245 250 255 

Pro Ser His Thr Ser Phe Trp Asn Arg Val Lys Asp Ala Phe He Gly 

260 265 270 

Glu Val Asp Glu 
275 



<210> 213 
<211> 540 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 213 

Met Ser Lys Glu He Lys Phe Ser Ser Asp Ala Arg Ser Ala Met Val 
15 10 15 

Arg Gly Val Asp lie Leu Ala Asp Thr Val Lys Val Thr Leu Gly Pro 
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20 



25 



30 



* 

Lys Gly Arg Asn Val Val Leu Glu Lys Ser Phe Gly Ser Pro Leu lie 
35 40 ,45 

5 

Thr Asn Asp Gly Val Thr lie Ala Lys, Glu lie' Glu Leu Glu Asp . His, 
50 55 60, ' 

Phe Glu Asn Met Gly Ala Lys Leu Val Ser Glu Val Ala Ser Lys Thr 
10 65 70 75' ,80 

* Asn Asp lie Ala Gly Asp Gly Thr Thr Thr Ala , Thr Val Leu Thr Gin. 



85 



90 



95 



15 Ala lie Val Arg Glu Gly lie Lys Asn Val Thr Ala Gly Ala Asn Pro 

100 105 110 



20 



He Gly He Arg Arg Gly He Glu Thr Ala Val Ala Ala Ala Val Glu 
115 120 125 

Ala Leu Lys Asn Asn Ala He Pro Val Ala Asn Lys Glu Ala He Ala 
130 135 140 



Gin Val Ala Ala Val Ser Ser Arg Ser Glu Lys Val Gly ulu Tyr He 
25 145 150 1 ' 155 160 

Ser Glu Ala Met Glu Lys Val Gly Lys Asp Gly Val He Thr He Glu 

165 170 . 175 

30 Glu Ser Arg Gly Met Glu Thr Glu Leu Glu Val Val Glu Gly Met Gin 

180 185 - 190 



35 



Phe Asp Arg Gly Tyr Leu Ser Gin Tyr Met Val Thr Asp Ser Glu Lys 

195 ' 200 205 

Met Val" Ala Asp Leu Glu Asn Pro Tyr He Leu He Thr Asp Lys Lys 
210 215 220 



He Ser Asn He Gin Glu He Leu Pro Leu Leu Glu Ser He Leu Gin 
40 225 230 235 240 

Ser Asn Arg Pro Leu Leu He He Ala Asp Asp Val Asp Gly Glu Ala 

245 250 255 

45 Leu Pro Thr Leu Val Leu Asn Lys He Arg Gly Thr. Phe Asn Val Val 

260 265" '270 



50 



Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met Leu Glu 
275 280 285 

Asp He Ala He Leu Thr Gly Gly Thr Val He Thr Glu Asp Leu Gly 
290 295 300 



Leu Glu Leu Lys Asp Ala Thr He Glu Ala Leu Gly Gin Ala Ala Arg 
55 305 310 315 320 



Val Thr Val Asp Lys Asp Ser Thr Val He Val Glu Gly Ala Gly Asn 
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i i 

,325 330 ' 335 , 

Pro Glu Ala lie Ser His 'Arg Val Ala Val. lie Lys Ser Gin lie Glu 

340 ■ t 345 . ■ . ., 350 

i ' 1 i 

Thr. Thr Thr Ser Glu Phe Asp Arg Glu Lys Leu Gin Glu Arg Leu -Ala 

355 ' 360 ' ' 365' : 

Lys Leu Ser Gly Gly Val Ala Val lie Lys Val Gly Ala Ala Thr Glu 

370 ,375 . 380 ! 

Thr Glu Leu Lys Glu Met Lys Leu Arg lie 1 Glu Asp Ala Leu Asn Ala * 
385- 390 , . -395 . 400 

Thr Arg Ala Ala Val Glu Glu Gly lie Val' Ala . Gly Gly Gly Thr Ala 

4t)5 410 - . 415 

Leu Ala Asn Val lie Pro Ala Val Ala Thr Leu Glu Leu Thr Gly Asp 

420 425 1 43.0 

* ■ 

1 i 

Glu Ala Thr Gly Arg Asn lie Val Leu Arg Ala Leu Glu Glu Pro Val 
43? 440 445» 

Arg Gin lie Ala His Asn Ala Gly Phe Glu Gly Ser lie Val lie Asp 
450 455 460 

Arg Leu Lys Asn Ala Glu Leu Gly lie Gly Phe Asn Ala Ala Thr Gly 
465 470 475 480 

Glu Trp Val Asn Met lie Asp Gin Gly lie lie Asp Pro Val Lys Val 

485 490 495 

Ser Arg Ser Ala Leu Gin Asn Ala Ala Ser Val Ala Ser Leu lie Leu 

- 500 505 510 

Thr Thr Glu Ala Val Val Ala Asn Lys Pro Glu Pro Val Ala Pro Ala 
515 520 525 - 

Pro Ala Met Asp Pro Ser Met Met Gly Gly Met Met 
530 535 540 



<210> 214 
<211> 481 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 214 

Met lie Lys lie Glu Thr Val Leu Asp lie Leu Lys Lys Asp Gly Leu 
1 5 10 15 

Phe Arg Glu lie lie Asp Gin Gly- His Tyr His Tyr Asn Tyr Ser Lys 

20 25 30 

Val lie Phe Asp Ser lie Ser Tyr Asp Ser Arg Lys Val Thr Glu Asp 
35 40 45 
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Thr Leu Phe Phe Ala Lys Gly Ala Ala Phe Lys Lys Glu Tyr Leu Leu 
50 55 60 

( Ser Ala lie Thr Gin Gly Leu Ala Trp Tyr Val' Ala Glu Lys Asp Tyf 
5 65 70 75 80, 

Glu Val Gly. lie Pro Val He He Val Asn Asp lie Lys Lys Ala Met 

85 90 95 

j 

10 Ser Leu He Ala Met Glu Phe Tyr Gly Asn Pro' Gin Glu Lys Leu Lys 

100 105 ' 110 



15 



Leu ! Leu Ala Phe Thr Gly Thr Lys Gly Lys Thr Thr Ala Ala Tyr 'Phe 

115 126 125 

1 

i 

Ala Tyr Asn He Leu Ser Gin Gly His Arg Pro Ala Met Leu Ser Thr 
130 135 140 1 



Met Asn Thr Thr Leu Asp Gly Glu Thr Phe Phe Lys Ser Ala Leu Thr 
20 145 150 155 160 



25 



Thr Pro Glu Ser lie Asp Leu Phe Asp Met Met Asn Gin Ala Val Gin 

165 170 175 



Asn Asp Arg Thr His Leu He Met Glu Val Ser Ser Gin Ala Tyr Leu 

180 185 190 



30 



Val Lys Arg Val' Tyr Gly Leu Thr Phe Asp Val Gly Val Phe Leu Asn 
195 200 205 

He Ser Pro Asp His He Gly Pro He Glu His Pro Ser Phe Glu Asp 
210 215 220 



Tyr Phe Tyr His Lys Arg Leu Leu Met Glu Lys Ser Arg Ala Val 'lie 
35 225 230 235 240 

He Asn Ser Asp Met Asp His Phe Ser Val Leu Lys Glu Gin Val Glu 

245 250 255 

40 Asp Gin Asp His Asp Phe Tyr Gly Ser- Gin Phe Asp Asn Gin He Glu 

260 265 270 



45 



Asn Ser Lys Ala Phe • Ser Phe Se.r Ala Thr Gly Lys Leu Ala Gly Asp 
275 280 285 

Tyr Asp -lie Gin Leu He Gly Asn Phe Asn Gin Glu Asn Ala Val Ala 
290 295 300 



Ala Gly Leu Ala Cys Leu Arg Leu Gly Ala Ser Leu Glu Asp He Lys 
50 305 310 3l5 320 



55 



Lys Gly. lie Ala Ala Thr Arg Val Pro Gly Arg Met Glu Val Leu Thr 

325 330 ' 335 . 

Gin Lys Asn Gly Ala Lys Val Phe lie Asp Tyr Ala His Asn Gly Asp 

340 345 350 
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Ser Leu Lys Lys Leu lie Asn Val Val Glu Thr His Gin Thr Gly Lys 
355 ', 360- 365 

lie Ala Leu Val Leu iGly Ser Thr Gly Asn Lys Gly Glu Ser Arg Arg 

370 1 375 ' 38,0 

Lys Asp Phe Gly Leu Leu Leu Asn Gin His Pro' Glu lie Gin Val Phe 
385 390 395 400 

i 

Leu Thr Ala Asp Asp t Pro Asn Tyr Glu ( Asp Pro' Met Ala lie Ala Asp 

405 ' t 410, , " $15 

Glu lie Ser Ser Tyr lie Asn His Proi Val Glu Lys lie Ala Asp Arg 

1 420 ' ■ 425 . 430 

i '* 
Gin Glu Ala lie Lys Ala Ala Met Ala lie Thr* Asn His Glu Leu Asp 
435 440 1 445 

Ala Val lie lie Ala Gly Lys Gly Ala Asp Cys Tyr Gin lie lie Gin 
450 455 • 460 

Gly Lys Lys Glu Ser Tyr Pro Gly Asp Thr Ala Val Ala' Glu Asn Tyr 
465 470 475 480 

i * ■ 

Leu 



<210> 215 
<211> 659 
<212> PRT 

<213> Streptococcus 
<400> 215 

Met lie Gin He Gly 
1 5 

Gin He Gly Arg Gly 

20 

He Leu Asp Gly Glu 
35 

Gin Thr Asp Pro He 
50 

Met Ala Asp Leu Asp 
65 

Glu Glu Asp Gly Gin 

85 

Asp Leu Lys Arg Tyr 

100 

Ala Val Arg He Met 
115 



pneumoniae 



Lys lie Phe Ala Gly Arg 

10 

Gly Met Ala Asp Val Tyr 

25 

» 

Glu Val Ala Val Lys Val 
40 

* 

Ala Val Ala Arg Phe Gin 
55 

His Pro His He Val Arg 
70 75 

Gin Tyr Leu Ala Met Glu 

90 

He Lys Glu His Tyr Pro 

105 

Gly Gin He Leu Leu -Ala 
120 



Tyr Arg He Val Lys 

15 

Leu Ala Lys Asp Leu 
30 

Leu Arg Thr Asn Tyr 
45 ' 

Arg Glu Ala Arg Ala 
60 

He Thr Asp He Gly 

80 

Tyr Val Ala Gly Leu 

95 . 

Leu Ser Asn Glu Glu 
110 

Met Arg Leu Ala His 
125 
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10 



25 



40 



55 



Thr Arg Gly lie Val His ' Arg Asp Leu Lys Pro Gin Asn lie Leu Leu 
130 135 140 

' i . ' -. 

Thr Pro Asp Gly Thr Ala Lys Val Thr Asp Phe Gly lie Ala Val Ala 

145 150 155, ' 160 

• - • i ~ 

t 

Phe Ala Glu Thr Ser Leu Thr Gin Thr Asn Ser Met Leu Gly Ser Val 

165 170 175 



His Tyr Leu Ser Pro 'Glu Gin Ala Arg Gly. Ser ,Lys Ala Thr* Val Gin 

180 185 ' ' 190, ' 



Ser Asp lie Tyr Ala Met Gly lie lie Phe, Tyr Glu Met Leu Thr. Gly 
15 195 , 200 , 205 

i * 

i < * 

His lie Pro Tyr Asp Gly Asp Ser Ala Val Thr lie Ala Leu Gin His 

210 215 220 

i 

i 

20 Phe Gin Lys Pro Leu Pro Ser Val lie Ala Glu Asn Pro Ser Val Pro 
225 230 235 240. 



Gin Ala Leu Glu Asn Val lie lie Lys Ala Thr Ala Lys Lys Leu Thr 



n A C ocn o r- r- 

<,u d.*J\J ^JJ 



Asn Arg Tyr Arg Ser Val Ser Glu Met Tyr Val Asp Leu Ser Ser Ser 

260 265 270 



Leu Ser Tyr Asn Arg Arg Asn Glu Ser Lys Leu lie Phe Asp Glu Thr 
30 275 280 285 

Ser Lys Ala Asp Thr Lys Thr Leu Pro Lys Val Ser Gin Ser Thr Leu 
290 295 300 

35 Thr Ser lie Pro Lys Val Gin- Ala Gin Thr Glu His Lys Ser lie Lys 

305 310 315 320 



Asn Pro Ser Gin Ala. Val Thr Glu Glu Thr Tyr Gin Pro Gin Ala Pro 

325 330 335 

Lys Lys His Arg Phe Lys Met Arg Tyr Leu lie Leu Leu Ala Ser Leu 

340 345 350 



Val Leu Val Ala Ala Ser Leu lie Trp lie Leu Ser Arg Thr Pro Ala 

45 355 • 360 365 

9 

Thr lie Ala lie Pro Asp Val Ala Gly Gin Thr Val Ala Glu Ala Lys 

370 ' 375 380 

50 Ala Thr Leu Lys Lys Ala .Asn Phe Glu lie Gly Glu Glu Lys Thr Glu 

385 390 395 400 



Ala Ser Glu Lys Val Glu Glu -Gly Arg lie lie Arg Thr Asp Pro Gly 

405 410 415 

Ala Gly Thr Gly Arg Lys Glu Gly Thr Lys lie Asn Leu Val Val Ser 

420 425 * 430 
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Ser Gly Lys Gin Ser Phe Gin lie Ser Asn Tyr Val Gly Arg Lys Ser 
435 440 445 

Ser Asp Val lie Ala Glu Leu Lys Glu Lys Lys Val Pro Asp Asn Leu, 
450 455 1 460 ■ 

i 

lie Lys lie Glu Glu Glu Glu Ser Asn Glu Ser Glu Ala Gly Thr Val 
465 470 475 480 

Leu Lys Gin Ser Leu Pro Glu Gly Thr Thr Tyr Asp Leu Ser Lys Ala 

485 i 490 . . 495 

I • ( : 

Thr Gin lie Val Leu Thr Val Ala Lys Lys Ala Thr Thr lie Gin Leu 

500 ; 505 510 

i 

Gly Asn Tyr lie Gly Arg Asn Ser Thr Glu Val lie Ser Glu Leu Lys 1 
515 520 525 

Gin Lys Lys Val Pro Glu Asn Leu lie Lys lie Glu Glu Glu Glu Ser 
530 535 540 

* 

Ser Glu Ser Glu Pro Gly Thr lie Met Lys Gin Ser Pro Gly Ala Gly 
545 550 555 560 

Thr Thr Tyr Asp Val Ser Lys Pro Thr Gin lie Val Leu Thr Val Ala 

565 570 575 

Lys Lys Val Thr Ser Val Ala Met Pro Ser Tyr lie Gly Ser Ser Leu 

580 585 590 . * 

Glu Phe Thr Lys Asn Asn Leu He Gin He Val Gly He Lys Glu Ala 
595 600 605 

Asn He Glu Val Val Glu Val Thr Thr Ala Pro Ala Gly Ser Ala Glu 
610 615 620 

Gly Met Val Val Glu Gin Ser Pro Arg Ala Gly Glu Lys Val Asp Leu 
625 630 635 640 

Asn Lys Thr Arg Val Lys He Ser He Tyr Lys Pro Lys Thr Thr Ser 

645 650 655 

Ala Thr Pro 



i ■ 



<210> 216 
<211> 391 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 216 

Met Lys His Phe Asp Thr He Val He Gly Gly Gly Pro Ala Gly Met 
15 10 15 

Met Ala Thr He Ser Ser Asn Phe Tyr Gly Gin Lys Thr Leu Leu He 
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20 



25 



30 



Glu Lys Asn Arg Lys Leu Gly Lys Lys Leu Ala Gly Thr Gly Gly Gly 

35 40 ' . . "45 



Arg Cys Asn Val Thr Asn Asn Gly Ser, Leu Asp 'Asn Leu Leu Ala Gly 
50 55 60.' 



lie Pro Gly Asn Gly Arg Phe Leu Tyr Ser Val Phe Ser Gin Phe' Asp. 
10 65 70 75' , 80 

Asn His Asp lie lie Asn Phe Phe Thr Glu Asn Gly Val Lys Leu Lys 



15 



20 



85 



90 



95 



Val Glu Asp His Gly Arg Val Phe Pro Ala Ser Asp Lys Ser Arg Thr 

100 * 105 * 110 

lie He Glu Ala Leu Glu Lys Lys He Thr Glu Leu Gly Gly Gin Val 
115 120 125 

« 

Ala Thr Gin He Glu lie Val Ser Val Lys Lys Val Asp Asp Gin Phe 
130 135' 140 



25 145 



Val Leu Lys Ser Ala Asp Gin Thr Phe Thr Cys Glu J_.ys Lieu lie Val 

150 ! ' 155 160 



30 



Thr Thr Gly Gly Lys Ser Tyr Pro Ser Thr Gly Ser Thr Gly Phe Gly 

165 170 175 

His Glu He Ala Arg His Phe Lys His Thr He Thr Asp Leu Glu Ala 

180 * 185 190 



35 



Ala Glu Ser Pro Leu Leu Thr Asp Phe Pro His Lys Ala Leu Gin Gly 

195 200 205 

He Ser Leu Asp Asp Val Thr Leu Ser Tyr Gly Lys His Val He Thr 
210 215 220 



His Asp Leu Leu Phe Thr His Phe Gly Leu Ser Gly Pro Ala Ala Leu 
40 225 230 235 240 

Arg Met Ser Ser Phe Val Lys Gly Gly Glu Val Leu Ser Leu Asp Val 

245- 250 255 

45 Leu Pro Gin Leu Ser Glu Lys Asp Leu Val Thr Phe Leu Glu Glu Asn 

260 265 270 



50 



Arg Glu Lys Ser Leu Lys Asn Ala Leu Lys Thr Leu Leu Pro Glu Arg 
275 280 285 

Leu Ala Glu Phe Phe Val Gin Gly Tyr Pro Glu Lys Val Lys Gin Leu 
290 . 295 300 



Thr Glu Lys Glu Arg Glu Gin Leu Val Gin Ser lie Lys Glu Leu Lys 
55 305 310 315 320 



He Pro Val Thr Gly Lys Met Ser Leu Ala Lys Ser Phe Val Thr Lys 
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i 

,325 330 335 

Gly Gly Val Ser Leu Lys felu lie Asn Pro Lys Thr Leu Glu Ser Lys 

340 ' I , 345 V 1 l ' 350 , 

Leu Vai Pro Gly, Leu His Phe Ala Gly Glu Val Met Asp lie Asn- Ala 
355 360 ' 365 ' , 

* 

His Thr Gly Gly Phe Asn He Thr Ser Ala Leu Cys Thr Gly Trp Val 
10 370 375 . 380 

Ala Gly Ser Leu His Tyr Asp 1 1 ' ', 

385 390 , 



15 



20 



30 



45 



<210> 217 
<211> 231 
<2'12> PRT 

<213> Streptococcus pneumoniae , 1 

i i 

<400> 217 

Met Leu Lys Trp Glu Asp Leu Pro Val Glu Met Lys Ser 1 Ser Glu Val 
1 5 10 *15 



25 Glu Ser Tyr Tyr Gin Leu Val Ser Lys Arg Lys Gly Ser Leu He Phe 

20 ' 25 30 



Lys Arg Cys Leu Asp Trp Val Leu Ala Leu Val Leu Thr Trp Val Leu 
35 40 45 

Thr Ser Pro lie Phe Leu lie Leu Ser He Trp He Lys Leu Asp Ser 
50 55 60 



Lys Gly Pro Val lie Tyr Lys Gin Glu Arg Val Thr Gin Tyr Asn Arg 
35 65 70 75 80 

Arg Phe Lys lie Trp Lys Phe Arg Thr Met Val Thr Asp Ala Asp Lys 

85 90 95 

40 Lys Gly Ser Leu Val Thr Ser Ala Asn Asp Ser Arg He .Thr Lys Val 

100 ' 105 110 



Gly Asn Phe lie Arg Arg Val Arg Leu Asp Glu Leu Pro Gin Leu Val 
115 120 125 

Asn Val Leu Lys Gly Glu Met Ser Phe Val Gly Thr Arg Pro Glu Val 
130 135 140 



Pro Arg Tyr Thr Glu Gin Tyr Ser Pro Glu Met Met Ala Thr Leu Leu 
50 145 150 155 160 

Leu Gin Ala Gly lie Thr Ser Pro Ala Ser lie Asn Tyr Lys Asp Glu 

165 ' 170 175 

55 Asp Thr lie lie Ser Gin Met Thr Glu Lys Gly Leu Ser Val Asp Gin 

180 185 190 
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Ala Tyr Val Glu His Val Leu Pro Glu Lys Met Arg Tyr Asn Leu Ala 
195 2d0 205 



Tyr Leu Arg Glu PheiSer Phe Phe Gly Asp lie Lys He Met Phe Gin 
210 ' 215 ' 220 

* * i 

■ i 

Thr Val Phe Glu Val Leu Lys 
225 230 



<210> 218 . " \ 

<211> 140 

<212> PRT ' ■ • i 

<213> . Streptococcus pneumoniae . 

i 

i 1 

<400> 218 

Met Thr Ser Pro Leu Leu Glu Ser Arg Arg Gin Leu Arg Lys Cys Ala 
' 1 5 ,10 15 



20 Phe Gin Ala Leu Met Ser Leu Glu Phe Gly Thr Asp Val Glu Thr Ala 

20 25 30 



Cys' Arg Phe Ala Tyr Thr His Asp Arg Glu Tyr Thr Asp Val Gin Leu 
35 40 45 

Pro Ala Phe Leu He Asp Leu Val Ser Gly Val Gin Ala Lys Lys Glu 
50 55 60 



Glu Leu Asp Lys Gin He Thr Gin His Leu Lys Ala Gly Trp Thr He 
30 65 70 75 80 

Glu Arg Leu Thr Leu Val Glu Arg Asn Leu Leu Arg Leu Gly Val Phe 

.85 90 • .95 

35 Glu He Thr Ser Phe Asp Thr Pro Gin Leu Val Ala Val Asn Glu Ala 

100 105 110 



lie Glu Leu Ala Lys Asp Phe Ser Asp Gin Lys Ser Ala Arg Phe lie 
115 120 125 

, 4 

Asn Gly Leu Leu Ser Gin Phe Val Thr Glu Glu Gin 
130 135 140 



45 <210> 219 
<211> 1179 
<212> PRT 

<213> Streptococcus pneumoniae 
50 <400> 219 

Met Tyr Leu Lys Glu He Glu He Gin Gly Phe Lys Ser Phe Ala Asp 
1 5 10 15 

Lys Thr Lys Val Val Phe Asp Gin Gly Val Thr Ala Val Val Gly Pro 
55 20 25 30 

Asn Gly Ser Gly Lys Ser Asn He Thr Glu Ser Leu Arg Trp Ala Leu 
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35 



40 



45 



Gly Glu Ser Ser Val Lys Ser Leu Arg Gly Gly Lys Met Pro Asp Val 

50 55 ' , 60 ■ . 



i ' 



lie Phe Ala Gly Thr Glu Ser Arg Lys Pro Leu 1 Asn ,Tyr Ala Ser ,Val 

65 70 75 1 80 ' 

Val Val Thr Leu Asp Asn His Asp Gly Phe lie Lys Asp Ala Gly Gin 

85 90 , 95 

Glu lie Arg Val Glu Arg His lie Tyr Arg Ser Gly Asp Ser Glu Tyr , 



100 



105 



110 



Lys lie Asp Gly Lys Lys Val Arg Leu Arg Asp, lie His Asp Leu Phe 
115 120 125 

Leu Asp Thr Gly Leu Gly Arg Asp Ser Phe Ser lie lie Ser Gin Gly 
130 135 140 

Lys Val Glu Glu lie Phe Asn Ser Lys Pro Glu Glu Arg Arg Ala lie 
145 150 155 160 

Phe Glu Glu Ala Ala Gly Val Leii Lys Tyr Lys Thr Arg Arg Lys Glu 

165 | ! 170 175 

Thr Glu Ser Lys Leu Gin Gin Thr Gin Asp Asn Leu Asp Arg Leu Glu 

180 185 190 



i • 



Asp lie lie Tyr Glu Leu Asp Asn Gin lie Lys Pro Leu Glu Lys Gin 
195 200 205 



Ala Glu Asn Ala Arg Lys Phe Leu Asp Leu Glu Gly Gin Arg Lys Ala 
• 210 , 215 220 

lie Tyr Leu Asp Val Leu Val Ala Gin lie Lys Glu Asn Lys Ala Glu 
225 ' 230 235 240 



Leu Glu Ser Thr Glu Glu Glu L6u Ala Gin Val Gin Glu Leu Leu Met 

245 250 255 

Ser Tyr Tyr Gin Lys Arg Glu Lys Leu Glu Glu Glu Asn Gin Thr Leu 

260 265 270 



Lys Lys Gin Arg 
275 

Ser Leu Met Asp 
290 

Ala Leu Ser Lys 
305 

Ala Gin Ala Arg 



Lys Glu Lys Tyr 



Gin Asp Leu Gin 

280 

Leu Thr Ser Leu 
295 

Leu Glu Ser Glu 
310 

■Leu Ala Ala Leu 
325 

Asp Lys Glu Ser 



Ala Glu Met Ala 



lie Ser Asp Leu 

300 

Gin Val Ala Leu 
315 

Glu Asp Lys Arg 
330 

Ser Leu Ala Leu 



Lys Asp Gin Gly 
285 

Glu Arg Lys Leu 



Asn Gin Gin Glu 

320 

Asn Ser Leu Ser 
335 

Leu Glu Gly Asn 



4 
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340 



345 



350 



Leu Val Gin Asn Asn Gin Lys Leu Asn Arg Leu Glu Ala Glu Leu Leu 

355 ' ! , ' 360 ' ' '365 



Ala Phe Ser Asp Asp Pro Asp- Gin Met lie Glu Leu Leu Arg Glu Arg 
370 ,375 ' 380 

Phe Val Ala Leu Leu Gin Glu Glu Ala Asp Val Ser Asn Gin Leu Thr 

10 385 390 ' 395' 1 400 

i 1 . 

>' . - i 

. ■ ' ' I 

Arg lie Glu Asn Glu Leu Glu Asn Ser Arg ( Gin Leu Ser Gin Lys Gin 

405 i 410 415 

15 Ala Asp Gin Leu G^u Lys Leu Lys Glu Gin Leu Ala Thr Ala Lys Glu 

420 . 425 . * 430 



20 



Lys Ala Ser Gin Gin Lys Asp Glu Leu Glu Thr Ala Lys Val Gin Val 

435 440 ' 445 

Gin Lys Leu Leu Ala Asp Tyr. Gin Ala lie Ala Lys Glu Gin Glu Glu , 

450 1 455 460 ' 



Gin Lys Thr Ser Tyr Gin Ala Gin Gin Ser Gin Leu Phe Asp Arg* Leu 
25 465 , 470 475 480 

Asp Ser Leu Lys Asn Lys Gin Ala Arg Ala Gin Ser Leu Glu Asn lie 

485 490 495 

30 Leu Arg Asn His Ser Asn* Phe Tyr Ala Gly Val Lys Ser Val Leu Gin 

500 505 510 



35 



Glu Lys Asp Arg Leu Gly Gly lie lie Gly Ala Val Ser Glu His Leu 
515 520 525 

Thr Phe Asp Val Tyr Tyr Gin Thr Ala Leu Glu lie Ala Leu Gly Ala 
530 535 540 



Ser Ser Gin His lie lie Val Glu Asp Glu Glu Ser Ala Thr Lys Ala 

40 545 550 555 560 

lie Asp Phe Leu Lys Arg Asn Arg Val Gly Arg Ala Thr Phe Leu Pro 

565 570 575 

45 Leu Thr Thr lie Lys Ala Arg Thr lie Ser Ser Gin Asn Gin Asp Ala 

580 585 - 590 



50 



lie Ala Val Ser Pro Gly Phe Leu Gly Met Ala Asp Glu Leu Val Thr 
595 600 605 

Phe Asp Thr Arg Leu Glu Ala He Phe Lys Asn Leu Leu Ala Thr Thr 
610 615 620 



Ala He Phe Asp Thr Val Glu His Ala Arg Glu Ala Ala Arg Gin Val 
55 625 630 635 640 

Arg Tyr Gin Val Arg Met Val, Thr Leu Asp Gly Thr Glu Leu Arg Thr 
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645 



650 



655 



Gly Gly Ser Tyr Ala Gly Gly Ala Asn Arg Gin Asn Asn Ser lie Phe 

660 665 , , 670 . 

lie Lys, Pro Glu Leu Glu Gin Leu Gin Lys Glu 'lie Ala Ala Asp Glu 
675 . 680 , 685 

Ala Ser Leu Gly Ser Glu Glu Ala Ala Leu Lys Thr Leu Gin Asp Gin 
10 690 695 700 



20 



Met Ala Ala Leu Thr Glu Arg Leu Glu Ala lie Lys Ser Gin Gly Glu 



705 



710 



715 



,720 



15 Gin Ala Arg lie Gin Glu Gin Gly Leu Ser Leu Ala Tyr Gin Gin Thr 



725 



730 



735 



i * < 



Ser Gin Gin Val Glu Glu Leu Glu Thr Leu Trp Lys Leu Gin Glu Glu 

740 745 750 

Glu lie Asp Arg Leu Ser Glu Gly Asp Trp Gin Ala Asp Lys Glu Lys 
755 ,760 765 



Cys Gin Glu Ser Leu Ala Thr lie 1 Ala Ser Asp Lys Gin Asn Leu Glu 
25 770 775 ! ■ 780 

Ala Glu lie Glu Glu lie Lys Ser Asn Lys Asn Ala lie GJLn Glu Arg 
785 790 795 800 

i 

30 Tyr Gin Asn Leu' Gin Glu Glu Val Ala Gin Ala Arg Leu Leu Lys Thr 

805 810 815 



35 



Lys Leu Gin Gly Gin Lys Arg Tyr Glu Val Ala Asp lie Glu Arg Leu 

820 825 830 

Gly Lys Glu Leu Asp Asn Leu Asn lie Glu Gin Glu Glu lie Gin Arg 
835 840 845 



Met Leu Gin Glu Lys Val Asp Asn Leu Glu Lys Val Asp Thr Glu Leu 
40 850 855 860 



Leu Ser Gin Gin Ala Glu Glu Ser Lys Thr Gin Lys Thr Asn Leu Gin 
865 870 875 880 

45 Gin Gly Leu lie Arg Lys Gin Phe Glu Leu Asp Asp lie Glu Gly Gin 

88'5 890 895 

Leu Asp Asp lie Ala Ser His Leu Asp Gin Ala Arg Gin Gin Asn Glu 

900 • 905 910 

Glu Trp lie Arg Lys Gin Thr Arg Ala Glu Ala Lys Lys Glu Lys Val 
915 920 925 

Ser Glu Arg Leu Arg His Leu Gin Asn Gin Leu Thr Asp Gin Tyr Gin 
55 930 935 940 

lie Ser- Tyr Thr' Glu Ala Leu Glu" Lys Ala His Glu Leu Glu Asn Leu 



50 



178 
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i 

I 

945 950 ' 955 96Q 

Asn Leu Ala Glu Gin Glu Val Gin Asp Leu Glu Lys Ala lie Arg Ser 

, 965 970 < , . ' 975 ' . , 



5 



20 



35 



45 



50 



Leu Gly Pro Val Asn Leu Glu Ala lie, Asp Gin* Tyr Glu. Glu Val His 

980 985 ' 990 



Asn Arg Leu Asp Phe Leu Asn Sef Gin Arg Asp Asp lie Leu Ser Ala 
10 995 1000, ' 1005 

Lys Asn Leu Leu Leu Glu Thr lie Thr Glu Met Asn Asp Glu Val Lys 



1010 1015 



1020 



15 Glu Arg Phe Lys Ser Thr Phe Glii Ala lie Arg Glu Ser Phe Lys Val 
1025 1030 i 1035 1040 



I 



I/ ■ 



Thr Phe Lys Gin Met Phe Gly Gly Gly Gin Ala Asp Leu lie Leu Thr 

1045 1050 - 1055 

Glu Gly Asp Leu Leu Thr Ala Gly Val Glu lie Ser Val Gin Pro Pro 

1060 1065 1070 



Gly Lys Lys lie Gin Ser Leu Asri Leu Met Ser Gly Gly Glu Lys Ala 
25 1075 1080 ' 1085 

Leu Ser Ala Leu Ala Leu Leu Phe Ser lie lie Arg Val Lys Thr lie 
1090 1095 1100 

30 Pro Phe Val lie Leu Asp Glu Val Glu Ala Ala Leu Asp Glu Ala Asn 
1105 1110 1115 1120 



Val Lys Arg Phe Gly Asp Tyr Leu Asn Arg Phe Asp Lys Asp Ser Gin 

1125 1130 1135 

Phe lie Val Val Thr His Arg Lys Gly Thr Met Ala Ala Ala Asp Ser 

1140 1145 1150 



lie Tyr Gly Val Thr Met Gin Glu Ser Gly Val Ser Lys lie Val Ser 
40 1155 1160 1165 



Val Lys Leu Lys Asp Leu Glu Ser lie Glu Gly 
1170 1175 



<210> 220 
<211> 447 
<212> PRT 

<213> Streptococcus pneumoniae 



<400> 220 

Met Thr Lys Arg Val Thr lie lie Asp Val Lys Asp Tyr Val Gly Gin 
15 10 15 



55 Glu Val Thr lie Gly Ala Trp Val Ala Asn Lys Ser Gly Lys Gly Lys 

20 25 30 
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lie Ala Phe Leu Gin Leu Arg Asp Gly Thr Ala Phe Phe Gin Gly Val 

35 40 45 ' 

Ala Phe Lys Pro Asn ( Phe Val Glu Lys Phe' Gly Glu -Glu Val Gly Leu 

5 50 , 55 ,60 . • 

* T I 

Glu Lys Phe Asp Val lie Lys Arg Leu Ser .Gin Glu Thr Ser Val Tyr 

65 70 75 80 

10 Val Thr Gly lie Val Lys Glu Asp Glu Arg Ser Lys Phe Gly Tyr Glu 

85 1 90 95 



15 



30 



45 



Leu Asp lie Thr Asp lie Glu Val Ile f Gly Glu Ser Gin Asp Tyr Pro 

' 100 ■ 105 t ' 110 

lie Thr Pro Lys Glu His Gly Thr Asp Phe Leu Met Asp Asn Arg His 
115 120 . 125 



Leu Trp Leu, Arg Ser Arg Lys Gin Val Ala Val Leu Gin lie Arg Asn 

20 130 135 140 

Ala lie lie Tyr Ala Thr Tyr Glu Phe Phe Asp Lys Asn f Gly Phe Met 

145 150 155 160 

25 Lys Phe Asp Ser Pro lie " Leu Ser Gly Asn Ala Ala Glu Asp Ser Thr 

165 170 175 



Glu Leu Phe Glu Thr Asp Tyr Phe Gly Thr Pro Ala Tyr Leu Ser Gin 

180 185 190 

Ser Gly Gin Leu Tyr Leu Glu Ala Gly Ala Met Ala Leu Gly Arg Val 
195 200 205 



Phe Asp Phe Gly Pro Val Phe Arg Ala Glu Lys Ser Lys Thr Arg Arg 
35 210 215 220 

His Leu Thr* Glu Phe Trp Met Met Asp Ala Glu Tyr Ser Tyr Leu Thr 
225 230 235 240 

40 His Asp Glu Ser Leu Asp Leu Gin Glu Ala Tyr Val Lys Ala Leu Leu 

245 250 255 



Gin Gly Val Leu Asp Arg Ala Pro Gin Ala Leu Glu Thr Leu Glu Arg 

260 265 270 

Asp Thr Glu Leu Leu Lys Arg Tyr lie Ala Glu Pro Phe Lys. Arg lie 
275 280 285 



Thr Tyr Asp Gin Ala lie Asp Leu Leu Gin Glu His Glu Asn Asp Glu 
50 290 295 300 

Asp Ala Asp Tyr Glu His Leu Glu His Gly Asp Asp Phe Gly Ser Pro 
305 310 315 320 

* * 

55 His Glu Thr Trp lie Ser Asn His Phe Gly Val Pro Thr Phe Val Met 

325 330 335 
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Asn Tyr Pro Ala Ala lie Lys Ala Phe Tyr Met Lys Pro Val Pro, Gly 

•340 1 ( ' 345 350 ' 

t 

i 

Asn Pro Glu Arg Va'l ,Leu Cys Ala Asp Leu ' Leu Ala 'Pro Glu Gly Tyr 
5 355 ' • ' 360 . , 365 ' • 

Gly Glu lie lie Gly Gly Ser Met Arg Glu Glu' Asp Tyr Asp Ala Leu 
370 375 380 

10 Val Ala Lys Met Asp Glu Leu Gly Met Asp Arg Thr Glu Tyr Glu Phe 
385 '390 ' 395 , 1 , 400 



15 



40 



55 



Tyr Leu Asp Leu Arg Lys Tyr Gly Thr, Val Pro His Gly Gly Phe Gly 

' ' 405 , ' 410 , 415 

i ' 

lie Gly lie Glu Arg Met Val Thr Phe Ala Ala Gly Thr Lys His lie 

420 425 ■ 430 



Arg Glu Ala lie Pro Phe Pro Arg Met Leu His Arg lie Lys Pro 
20 435 440 . 445 



<210> 221 
<211> 308 
25 <212> PRT 

i 

<213> Streptococcus pneumoniae 
<400> 221 

Met Ser Glu Lys . Leu Val Glu lie Lys Asp Leu Glu lie Ser Phe Gly 
30 1 5 10 15 

Glu Gly Ser Lys Lys Phe Val Ala Val Lys Asn Ala Asn Phe Phe lie 

20 25 30 

35 Asn Lys Gly Glu Thr Phe Ser Leu Val Gly Glu Ser Gly Ser Gly Lys 

35 40 45 



Thr Thr lie Gly Arg Ala lie He Gly Leu Asn Asp Thr Ser Asn Gly 
50 55 60 

Asp He He Phe Asp Gly Gin Lys He Asn Gly Lys Lys Ser Arg Glu 

65 70 75 80 



Gin Ala Ala Glu Leu He Arg Arg He Gin Met He Phe Gin Asp Pro 
45 85 90 95 

Ala Ala Ser Leu Asn Glu Arg Ala Thr Val Asp Tyr He lie Ser Glu 

100 105 110 

50 Gly Leu Tyr Asn His Arg Leu Phe Lys Asp Glu Glu Glu Arg Lys Glu 

115 120 125 



Lys Val Gin Asn He He Arg Glu Val Gly Leu Leu Ala Glu His Leu 
130 135 140 

Thr Arg Tyr Pro His Glu Phe Ser Gly Gly Gin Arg Gin Arg He Gly 

145 150 155 160 
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10 



25 



50 



lie Ala Arg Ala Leu Val Met Gin Pro Asp Phe Val He Ala Asp Glu 

165 170 175 

Pro He Ser Ala Leu Asp Val Ser Val Arg Ala Gin Val Leu Asn Leu 

180 185 i , , 190 

i 

Leu Lys Lys Phe Gin Lys Glu Leu Gly Leu Thr Tyr Leu Phe He Ala 

195 200 205 

His Asp Leu Ser Val Val Arg Phe' He Ser Asp Arg lie Ala Val lie 

210 215 i 220 



Tyr Lys Gly Val ' He Val Glu Val Ala Glu Thr Glu Glu Leu Phe Asn 

15 225 , 230 j 235 240 

i 

Asn Pro He His Pro Tyr Thr Gin Ala .Leu Leu Ser Ala Val Pro Ilel 

245 j 250 . 255 

20 Pro Asp Pro He Leu Glu Arg Lys Lys Val Leu Lys Val Tyr Asp Pro 

260 : 265 270 



Ser Gin His Asp Tyr Glu Thr Asp Lys Pro Ser Met Val Glu He Arg 
275 280 285 

I 1 

Pro Gly His Tyr Val Trp Ala Asn Gin Thr Glu Leu Ala Arg Tyr Gin 
290 295 300 



Lys Gly Leu Asn 
30 305 



<210> 222 
<211> 424 
35 <212> PRT 

<213> Streptococcus pneumoniae 

<400> 222 

Met Lys He Ser Trp Asn Gly Phe Ser Lys Lys Ser Tyr Gin Glu Arg 
40 1 5 10 15 

Leu Glu Leu Leu Lys Ala Gin Ala Leu Leu Ser Pro Glu Arg Gin Ala 

20 25 30 

45 Ser Leu Glu Lys Asp Glu Gin Met Ser Val Thr Val Ala Asp. Gin Leu 

35 40 . 45 



Ser Glu Asn Val Val. Gly Thr Phe Ser Leu Pro Tyr Ser Leu Val Pro 
50 55 60 

Glu Val Leu Val Asn Gly Gin Glu Tyr Thr Val Pro Tyr Val Thr Glu 

. 65 70 75 80 



Glu Pro Ser Val Val Ala Ala Ala Ser Tyr Ala Ser Lys He He Lys 
55 85 90 95 

Arg Ala Gly Gly Phe Thr Ala Gin Val His Gin Arg Gin Met He Gly 
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100 



105 



110 



Gin Val Ala Leu Tyr Gin lie Ala Asn Pro Lys Leu Ala Gin Glu Lys 
115 120 , 125 ■ 

5 

lie Ala Ser Lys Lys Ala Glu Leu Leu. Glu Leu [ Ala Ash Gin Ala . Tyr 
130 135 • 140. ' 

Pro Ser lie Val Lys Arg Gly Gly Gly Ala Arg Asp Leu His Val Glu 

10 145 150 155' , 1,60 



15 



20 



Gin He Lys Gly Glu Pro Asp Phe Leu Val Val . Tyr lie His Val Asp. 



165 



170 



175 



Thr Gin Glu Ala Met Gly Ala Asn Met Leu Asn Thr Met Leu Glu Ala * 

180 ■ 185 190 

Leu Lys Pro Val Leu Glu Glu Leu Ser Gin Gly Gin Ser* Leu Met Gly ' 
195 200 205 

He Leu Ser Asn Tyr Ala Thr Asp Ser Leu Val Thr Ala Ser Cys Arg 
210 215 220 



He Ala Phe Arg Tyr Leu Ser Arg Gin Lys Asp Gin Giy Arg Glu He 

25 225 230 ! ' 235 240 

Ala Glu Lys He Ala Leu Ala Ser Gin Phe Ala Gin Ala Asp Pro Tyr 

245 250 255 

30 Arg Ala Ala Thr His Asn Lys Gly He Phe Asn Gly He Asp Ala He 

260 265 270 



35 



Leu He Ala Thr Gly Asn Asp Trp Arg Ala He Glu Ala Gly Ala His 
275 280 285 

Ala Phe Ala Ser Arg Asp Gly Arg Tyr Gin Gly Leu Ser Cys Trp Thr 
290 295 300 



Leu Asp Leu Glu Arg Glu Glu Leu Val Gly Glu Met Thr Leu Pro Met 
40 305 310 315 320 

Pro Val Ala Thr Lys Gly Gly Ser He Gly Leu Asn Pro Arg Val Ala 

325 ' 330 335 

45 Leu Ser His Asp Leu Leu Gly Asn Pro Ser Ala Arg Glu Leu Ala Gin 

340 345 350 



50 



He He Val Ser lie Gly Leu Ala Gin Asn Phe Ala Ala Leu Lys Ala 
355 360 365 

Leu Val Ser Thr Gly He Gin Gin Gly His Met Lys Leu Gin Ala Lys 
370 375 380 



Ser Leu Ala Leu Leu Ala Gly Ala Ser Glu Ser Glu Val Ala Pro Leu 
55 385 390 395 400 



Val Glu Arg Leu He Ser Asp Lys Thr Phe Asn Leu Glu Thr Ala Gin 
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,405 410 415 

t J 1 I 

Arg Tyr Leu Glu Asn Leu Arg Ser 

420, , ., . ' '. . . 



<210> 223 1 , ' 

<211> 262 
<212> PRT 

<213> Streptococcus pneumoniae 

' ' . .. 'i 

<400> 223 1 ' 

Met Pro lie Thr Ser Leu Glu He Lys Asp Lys Thr Phe Gly Thr Arg 

1 i i i 5 • 1 10 15 

i i , , 

! ' * 

Phe Arg Gly Phe Asp Pro Glu Glu Val Asp Glu Phe Leu Asp He Val 

20 . 25 , ' 30 

Val Arg Asp Tyr Glu Asp Leu Val Arg Ala Asn. His Asp Lys Asn Leu 
35 40 . 45 

Arg He Lys Ser Leu Glu Glu Arg Leu Ser Tyr Phe Aspi Glu He Lys 
50 55 60 

Asp Ser Leu Ser Gin Ser Val Leu lie Ala Gin Asp Thr Ala Glu Arg 
65 70 75 80 

Val Lys Gin Ala Ala His Glu Arg Ser Asn Asn He He His Gin Ala 

85 90 95 

Glu Gin Asp Ala Gin Arg Leu Leu Glu Glu Ala Lys Tyr Lys Ala Asn 

100 105 110 

Glu lie Leu Arg Gin Ala Thr Asp Asn Ala Lys Lys Val Ala Val Glu 
115 120 125 

Thr Glu Glu Leu Lys Asn Lys Ser Arg Val Phe His Gin Arg Leu Lys 
130 135 140 

Ser Thr He Glu Ser Gin Leu Ala He Val Glu Ser Ser Asp Trp Glu 
145 150 155 * 160 

Asp He Leu Arg Pro Thr Ala Thr Tyr Leu Gin Thr Ser Asp Glu Ala 

165 170 175 

Phe Lys Glu Val Val Ser Glu Val Leu Gly Glu Pro He Pro Ala Pro 

180 185 190 

He Glu Glu Glu Pro He Asp Met Thr Arg Gin Phe Ser Gin Ala Glu 
195 200 205 

Met Ala Glu Leu Gin Ala Arg lie Glu Val Ala Asp Lys Glu Leu Ser 
210 215 220 

Glu Phe Glu Ala Gin He Lys Gin Glu Val Glu Ala Pro Thr Pro Val 
225 230 235 240 
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Val Ser Pro Gin Val Glu Glu Glu Pro Leu Leu lie Gin Leu Ala Gin 

245 250 255 



, Cys f Met Lys Asn Gin Lys 
5 260 



■ i * 



<210> 224 

<211> 575 

10 <212> PRT 

<213> Streptococcus pneumoniae 

' <400> 224 

Met Ser Asn Gly Gin Leu lie Tyr Leu Met Val Ala lie Ala Val lie 

15 ,1 1 5 I 10 15 



' Leu Val Leu Ala Tyr Val Val Ala lie Phe Leu Arg Lys Arg Asn Glu 

20 25 30 

20 Gly Arg Leu Glu Ala Leu Glu Glu Arg Lys* Glu Glu Leu Tyr Asn Leu 

35 40 45 



i.' * 



25 



Pro Val Asn Asp Glu Val Glu Ala Val Lys Asn Met His Leu lie Gly 



50 



60 



Gin Ser Gin Val Ala Phe Arg Glu Trp Asn Gin Lys Trp Val Asp Leu 
65 70 75 80 



Ser Leu Asn Ser Phe Ala Asp lie Glu Asn Asn Leu Phe Glu Ala Glu 
30 85 . 90 95 

Gly Tyr Asn His Ser Phe Arg Phe Leu Lys Ala Ser His Gin He Asp 

100 105 110 

35 Gin He Glu Ser Gin He Thr Leu He Glu Glu Asp He Ala Ala He 

115 120 125 



40 



Arg Asn Ala Leu Ala Asp Leu Glu Lys Gin Glu Ser Lys Asn Ser Gly 
130 135 140 

Arg Val Leu His Ala Leu Asp Leu Phe Glu Glu Leu Gin His Arg Val 
145 150 155 160 



Ala Glu Asn Ser Glu Gin Tyr Gly Gin Ala Leu Asp Glu He Glu Lys 
45 165 170 175 

Gin Leu Glu Asn He Gin Ser Glu Phe Ser Gin Phe Val Thr Leu Asn 

180 185 190 

50 Ser Ser Gly Asp Pro Val Glu Ala Ala Val He Leu Asp Asn Thr Glu 

195 200 205 



55 



Asn His He Leu Ala Leu Ser His He Val Asp Arg Val Pro Ala Leu 

210 215 220 

Va'l Thr Thr Leu. Ser Thr Glu Leu Pro Asp Gin Leu Gin Asp Leu Glu 

225 230 235 240 
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Ala Gly Tyr Arg Lys Leu' lie Asp Ala' Asn Tyr His Phe Val Glu Thr 

245 250 ■ 255 

Asp lie Glu Ala Arg Phe His Leu lieu Tyr Glu Ala Phe L^s Lys Asn 

260.1. 265 270 ' ' 

Gin Glu Asn lie Arg Gin Leu Glu Leu Asp Asn Ala Glu Tyr Glu Asn 
275 280 285 

Gly Gin Ala Gin Glu 'Glu lie Asn Ala ' Leu Tyr ( Asp lie Phe Thr Arg 
290 295 ' ' 300 , 

Glu lie Ala Ala Gin Lys Val Val Glu Asn, Leu Leu Ala Thr Leu Pro 
305 , 310 315. 320 

Thr Tyr Leu Gin His Met Lys Glu Asn. Asn Thr Leu Leu Gly Glu Asp 

325 • 330 ' 335 

i 

lie Ala Arg Leu Asn Lys Thr Tyr Leu Leu Pro Glu Thr Ala Ala Ser 

340 ' 345 350 

*_ i ' 

His Val Arg Arg lie Gin Thr Glu Leu Glu Ser Phe Glu Ala Ala lie 
355 360 365 

Val Glu Val Thr Ser Asn Gin Glu Glu Pro Thr Gin Ala Tyr Ser Val 
370 375 380 

Leu Glu Glu Asn Leu Glu Asp Leu Gin Thr Gin Leu Lys Asp lie. Glu 
385 390 395 400 

Asp Glu Gin lie Ser Val Ser Glu Arg Leu Thr Gin lie Glu Lys Asp 

405 410 415 

Asp lie Asn Ala Arg Gin Lys Ala Asn Val Tyr Val Asn Arg Leu His 

420 425 430 

Thr lie Lys Arg Tyr Met Glu Lys Arg Asn Leu Pro Gly lie Pro Gin 
435 440 445 

Thr Phe Leu Lys Leu Phe Phe Thr Ala Ser Asn Asn Thr Glu Asp Leu 
450 455 460 

Met Val Glu Leu Glu Gin Lys Met lie Asn He Glu Ser Val Thr Arg 
465 470 475 480 

Val Leu Glu He Ala Thr Asn Asp Met Glu Ala Leu Glu Thr Glu Thr 

485 490 495 

Tyr Asn He Val Gin Tyr Ala Thr Leu Thr Glu Gin Leu Leu Gin Tyr 

500 505 510 

Ser Asn Arg Tyr Arg Ser Phe Asp Glu. Arg He Gin Glu Ala Phe Asn 
515 520 525 

Glu Ala Leu Asp He Phe Glu Lys Glu Phe Asp Tyr His Ala Ser Phe 
530 535 540 
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Asp Lys lie Ser Gin Ala Leu Glu Val Ala Glu Pro Gly Val Thr Asn 
545 , 550 , , 555 560 

' i ' • 

i • ■ • ■ 1 

Arg Phe Val Thr Ser Tyr Glu Lys Thr Arg Glu Thr lie Arg Phe 

1 565 570 . . .575 • 



<210> 225 * 

10 <211> 800 

<212> PRT 

<213> Streptococcus pneumoniae 



20 



35 



50 



■ i 



<400> 225 



15 Met Leu lie Ser Tyr Lys Trp Leu Lys Glu Leu Val Asp, lie Asp Val 
1 ' 5 10 15 



Pro Ser Gin Glu Leu Ala Glu Lys Met Ser Thr Thr Gly lie Glu Val 

20 25 30 

Glu Gly Val Glu Ser Pro Ala Ala Gly Leu Ser Lys lie Val Val Gly 
35 40 45 



Glu Val Leu Ser Cys Glu Asp Val Pro Glu Thr His Leu His Val Cys 

25 50 55 60 

Gin Val Asn Val Gly Glu Glu Glu Arg Gin lie Val Cys Gly Ala Pro 

65 70 ,75 80 

■30 Asn Val Arg Ala Gly lie Lys Val Met Val Ala Leu Pro Gly Ala Arg 

85 90 95 



lie Ala Asp Asn Tyr Lys lie Lys Lys Gly Lys lie Arg Gly Leu Glu 

100 105 110 

Ser Leu Gly Met lie Cys Ser Leu Gly Glu Leu Gly lie Ser Asp Ser 
115 120 125 



Val Val Pro Lys Glu Phe Ala Asp Gly lie Gin lie Leu Pro Glu Asp 
40 130 135 140 

Ala Val Pro Gly Glu Glu Val Phe Ser Tyr Leu Asp Leu Asp Asp Glu 
145 150 155 160 

45 lie lie Glu Leu Ser lie Thr Pro Asn Arg Ala Asp Ala Leu Ser Met 

165 170 175 



Cys Gly Val Ala His Glu Val Ala Ala lie Tyr Asp Lys Ala Val Asn 

180 185 190 • 

Phe Lys Glu Phe Thr Leu Thr Glu Thr Asn Glu Ala Ala Ala Asp Ala 
195 200 205 



Leu Ser Val Ser lie Glu Thr Asp Lys Ala Pro Tyr Tyr Ala Ala Arg 
55 210 215" 220 

lie Leu Asp Asn Val Thr lie Ala Pro Ser Pro Gin Trp Leu Gin Asn 
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225 230 1 235 ,240 

Leu Leu Met Asn Glu Gly lie Arg Pro lie Asn Asn Val Val Asp Val 

245 250 , , , 255 - — • 

ci • t 

Thr Asn Tyr lie Leu Leu Tyr Phe Gly Gin Pro* Met His Ala Phe Asp' 

260 - 265 270 

Leu Asp Asn Phe Glu Gly Thr Asp lie Arg Val Arg Glu Ala Arg .Ala 
10 275 280 285 

Gly Glu Lys Leu Val Thr Leu Asp Gly Glu Glu Arg Asp Leu Asp Val 

290 295 j 300 ' 

! 

15 Asn Asp Leu Val lie Thr Val Ala Asp Lys Pro Val Ala Leu Ala Gly 
305 310 315 320 



20 



35 



50 



Val Met Gly Gly Gin Ala Thr Glii lie Ser Glu Lys Ser Ser Arg Val 

325 330 335 

Val Leu Glu Ala Ala Val Phe Asn Gly Lys Ser lie Arg Lys Thr Ser 

340 345 350 



Gly Arg Leu Asn Leu Arg Ser Glu Ser Ser Ser Arg Phe Glu Lys Gly 
25 355 360 365 

He Asn Val Ala Thr Val Asn Glu Ala Leu Asp Ala Ala Ala Ser Leu 
370 375 380 

30 He Ala Glu Leu Ala Gly Ala Thr Val Arg Lys Gly He Val Ser Ala 

385 390 395 400 



Gly Glu Leu Asp Thr Ser Asp Val Glu Val Ser Ser Thr Leu Ala Asp 

405 410 415 

Val Asn Arg Val Leu Gly Thr Glu Leu Ser Tyr Ala Asp Val Glu Asp 

420 425 430 



Val Phe Arg Arg Leu Gly Phe Gly Leu Ser Gly Asn Ala Asp Ser Phe 
40 435 440 445 

Thr Val Arg Val Pro Arg Arg Arg Trp Asp He Thr lie Glu Ala Asp 
450 455 460 

45 Leu Phe Glu Glu He Ala Arg He Tyr Gly Tyr Asp Arg Leu Pro Thr 
465 ~ 470 475 480 



Ser Leu Pro Lys Asp Asp Gly Thr Ala Gly Glu Leu Thr Ala Thr Gin 

485 490 495 

Lys Leu Arg Arg Gin Val Arg Thr He Ala Glu Gly Ala Gly Leu Thr 

500 505 510 



Glu He He Thr Tyr Thr Leu Thr Thr Pro Glu Lys Ala Val Glu Phe 
55 515 520 525 

Thr Ala Gin Pro Ser Asn Leu Thr Glu Leu Met Trp Pro Met Thr Val 
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35 



50 



Gly lie Leu Glu Ala Leu Phe Thr Arg Leu Gly Leu Gin Val Thr Tyr 
625 630 635 640 

Thr Ala Thr Ser Glu lie Ala Ser Leu His Pro Gly Arg Thr Ala Val 

645 ' . "650 655 



lie Ser Leu Gly Asp Gin Val Leu <diy fhe Leu Gly Gin Val His Pro 
25 660 I 665 670 

Val Thr Ala Lys Ala Tyr Asp lie Pro Glu Thr Tyr Val Ala Glu Leu 
675 680 685 

30 Asn Leu Ser Ala lie Glu Ala Ala Leu Gin Pro Ala Thr Pro Phe Val 
690 695 700 



Glu lie Thr Lys Phe Pro Ala Val Ser Arg Asp Val Ala Leu Leu Leu 

705 710 715 720 

Lys Ala Glu Val Thr His Gin Glu Val Val Asp Ala lie Gin Ala Ala 

725 730 735 



Gly Val Lys Arg Leu Thr Asp lie Lys Leu Phe Asp Val Phe Ser Gly 

40 740 745 750 

Glu Lys Leu Gly Leu Gly Met Lys Ser Met Ala Tyr Ser Leu Thr Phe 

755 760 765 

45 Gin Asn Pro Glu Asp Ser Leu Thr Asp Glu Glu Val Ala Arg Tyr Met 
770 775 780 



Glu Lys lie Gin Ala Ser Leu Glu Glu Lys Val Asn Ala Glu Val Arg 
785 790 795 800 



55 <210> 226 
<211> 180 
<212> PRT 
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t . * 

I 

530 535 : , 540 

* 

Asp Arg Ser Val Leu Arg Gin Asn Met lie. Ser Gly lie' Leu Asp Thr 
, 545' 550 555' ' $60' . 

5 ... • . , ( , 

Val Ala Tyr Asn Val Ala Arg Lys Asn, Lys Asn' Leu Ala Leu Tyr Glu, * 

565 570 ' ■ " 575 

lie Gly Lys Val Phe Glu Gin Thr Gly Asn Pro Lys Glu Glu Leu Pro 

10 580 , 585 . 590 

i 

Asn Glu lie Asn Ser Phe Ala Phe Ala Leu Thr Gly. Leu Val Ala Glu 1 , 
■ 1 595 606 605 ' ■ ' * 

+ 

15 Lys Asp Phe Gin Thr Ala Ala Vail Pro Val Asp Phe Phe Tyr Ala Lys 
610 615 J 620 
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<213> Streptococcus pneumoniae 



<400> 226 

Met Leu Glu Asn'Asp^le Lys Lys" Val' Leu' Val Ser > -His Asp Glu lie 
5 1 5 ' ' 10 , 1 15 

i i ■ i 

Thr Glu Ala Ala Lys Lys Leu Gly Ala Gin Leu Thr Lys Asp Tyr Ala 

20 25 30 

10 Gly Lys Asn Pro He Leu -Val Gly He Leu Lys Gly Ser lie Pro Phe 

35 ' 40 1 '. , 45 i ■ • 



» i 

Met Ala Glu Leu Val Lys His He Asp, Thr His lie Glu Met Asp" Phe 
'50 i r ■ 55 ' ^ 60 

' i '' 
Met Met Val Ser Ser Tyr His Gly Gly Thr Ala Ser Ser Gly Val lie 

65 70 ,75 ' 80 



Asn He Lys Gin Asp Val Thr Gin Asp lie Lys Gly Arg His Val Leu 
20 85 90 , 95 

Phe Val Glu Asp lie He Asp Thr Gly Gin Thr Leu Lys' Asn Leu Arg 

100 105 110 

25 Asp Met Phe Lys Ala Arg Glu Ala Ala Ser Val Lys lie Ala Thr Leu 

115 * 120 125 



Leu Asp Lys Pro Glu Gly Arg Val Val Glu He Glu Ala Asp Tyr Thr 
130 135 140 

Cys Phe Thr He Pro Asn Glu Phe Val Val Gly Tyr Gly Leu Asp Tyr 
145 150 155 * 160 



Lys Glu Asn Tyr Arg Asn Leu Pro Tyr He Gly Val Leu Lys Glu Glu 
35 165 170 175 



Val Tyr Ser Asn 

180 



i 
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